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Foreword 


This manual contains detailed information regarding the design and applications of the AT&T DSP3210 Digital 
Signal Processor that is essential to engineers designing systems using this device. The AT&T DSP3210 Support 
Software Tools, AT&T DSP3210 C-Compiler, and the AT&T DSP3210 Application Library are available to aid in 
developing software for the devices and integrating them into system environments. 


Additional information on the digital signal processor product line is available in the form of manuals, data sheets, 
and application notes. 
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1. INTRODUCTION 


The AT&T DSP3210 brings the power of floating-point signal processing to personal computers and workstations 
opening a wide range of multimedia applications. From its conception, the DSP3210 has been engineered with a 
single focus: enabling advanced multimedia functions on personal computers and workstations. Based on AT&T's 
very successful DSP32C architecture, the DSP3210 breaks new ground in its ability to be easily integrated into 
personal computer and workstation system designs. Particular attention was paid to primary bus interfacing; the 
DSP3210 is compatible with both Intel and Motorola microprocessor signalling. This allows designers to easily 
create low-cost systems using the DSP3210 as a bus-master device. A full bus-level SmartModel * of the 
DSP3210 is offered by Logic Automation for system simulation of designs incorporating the DSP3210. 


In addition to its optimizing C compiler and assembly language support tools, the DSP3210 features a powerful 
real-time operating system, the AT&T Visible Caching Operating System™ ( vcCOSs™), which supports both multi- 
tasking and multiprocessing. The VCOS operating system is an open development environment that allows the 
DSP3210 to use existing system memory in PCs and workstations rather than requiring expensive dedicated 
SRAM. Complete real-time debugging tools are included with the VCOS operating system to speed both 
application and algorithm development. By separating the application and algorithm development phases of 
multimedia software development, development schedules are simplified and shortened . The VCOS operating 
system includes its own multimedia function library complete with speech processing, speech recognition, 
graphics, music processing, and modem functions. In addition to new application software, the DSP3210 has 
access to the large base of existing DSP32C application software. 


This document is a reference guide for the DSP3210 device. It describes the architecture, instruction set, and 
interfacing specifications of the device. The information necessary to write programs for the DSP3210 is also 
given in this manual. Separate documents are available with detailed information on the software and hardware 
development tools. No previous experience with digital signal processors is necessary to use this document, but 
an understanding of digital signal processing concepts and assembly language programming on microprocessors 
or minicomputers is recommended. ° 


1.1 Digital Signal Processing With The DSP3210 


A key element shared by all DSP applications is a large number of repetitive mathematical operations combined 
with extreme memory bandwidth requirements. To perform floating-point arithmetic operations, conventional 
microprocessors and co-processors use software routines and/or microcode. Newer RISC- and CISC-based 
microprocessors offer much higher floating-point performance but lack the memory bandwidth and signal 
processing architecture required for DSP applications. Microprocessors also include many functions unnecessary 
for signal processing resulting in an unacceptable price/performance ratio for high-volume, low-cost signal 
processing applications. 


The DSP3210 architecture combines a hardware floating-point arithmetic unit with an architecture that supports 4 
memory accesses per instruction cycle, allowing the device to perform up to 33 million floating-point operations 
per second (with a clock rate of 66.7 MHz). This raw floating-point performance is combined with the DSP3210's 
signal processing architecture to enable real-time applications such as speech compression/recognition/synthesis, 
3D graphics transformations, high-quality music coding/decoding, image processing, and real-time handwriting 
analysis/recognition. New functions can easily be added to DSP3210-based systems with the addition of 
application software. Multiple DSP3210s can be combined easily to increase system performance. (AT&T's 
VCOS real-time operating system directly supports multiple DSP3210 implementations.) 


The use of floating-point arithmetic in the DSP3210 simplifies application development because it eliminates the 
effects of scaling, normalization, and overflow (which complicate development when using a fixed-point device). 
Also, many existing algorithms developed for use on large computers and array processors (using floating-point 
arithmetic) can be easily converted for use on the DSP3210. Program development is typically 3 to 10 times 
faster on the DSP3210 than similar program development on fixed-point processors. This is especially critical in 
multimedia applications where time-to market is a critical parameter, and a large percentage of application 
development is performed by third-party application developers. In addition to ease of program development, 
32-bit floating-point is often required for advanced algorithm implementations. 


> SmariModelis a registered trademark of Logic Automation. 
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Early in the development of a particular application, one must determine the dynamic range and precision 
requirements for the given algorithm. The following is a comparison of the dynamic range and precision 
attainable for various data word sizes. The floating-point representation of a number in the DSP3210 consists of 
a 24-bit mantissa and an 8-bit exponent. The magnitude of the mantissa is normalized to lie between 1 and 2. 
Thus, the range for the magnitude of a floating-point number N is approximately: 


IN| = (1.x 2-127, 2x 2127) of (5.9x 10 “393.4 x 1038), 
providing an overall dynamic range in excess of 1500 dB. 


Figure 1-1 shows the dynamic range for DSP3210 floating-point numbers as compared with the dynamic range of 
16- and 24-bit fixed-point devices. 
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Figure 1-1 Dynamic Range of Fixed- and Floating-Point DSPs 


The precision of a floating-point number is determined by the size of the mantissa. The DSP3210 floating-point 
format provides 24 bits of precision at all magnitudes within the range of numbers represented by the 24-bit 
mantissa and 8-bit exponent. (Note that the DSP3210 also has eight extra bits, the guard bits, in the mantissa of 
the 40-bit floating-point adder and accumulators. These guard bits provide added precision during accumulation 
operations). In contrast, the precision of a fixed-point format is a function of its word size and the magnitude of 
the given number. Figure 1-2 shows the bits of precision versus magnitude of a number for the DSP3210 floating- 
point format, as well as 16- and 24-bit fixed-point formats. 
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Figure 1-2 Precision of Fixed- and Floating-Point DSPs 


4.2 The AT&T Floating-Point Digital Signal Processor Family 


AT&T was the first to produce floating-point digital signal processors and has continued to play a leadership role 
in the floating-point marketplace. Figure 1-3 shows the evolution of the AT&T family of floating-point digital signal 


processors. 
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Figure 1-3 AT&T Floating-Point DSP Family Evolution 
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1.2.1 The AT&T DSP32 Digital Signal Processor 


In 1984, AT&T developed the first single-chip floating-point programmable digital signal processor, the DSP32. 
The design goals of the DSP32 were high performance and ease of use. A major contributor to these goals was 
the selection of 32-bit floating-point arithmetic for this device. In the DSP32, a highly pipelined architecture allows 
the device to achieve high throughput. This architecture includes a 32-bit floating-point data arithmetic unit (DAU) 
and a 16-bit fixed-point control arithmetic unit (CAU). The DAU incorporates a 32-bit floating-point multiplier, a 
40-bit floating-point adder, and four 40-bit accumulators. The CAU performs 16-bit fixed-point arithmetic and logic 
operations, and provides data move and control capabilities. This unit includes 21 general-purpose registers. 
The DSP32 processes over six million instructions per second. All instructions execute in a single cycle (four 
clock periods or 160 ns) and include post-normalization for each floating-point operation. An instruction may have 
two floating-point operations: a floating-point multiplication and a floating-point addition, yielding two floating-point 
operations per instruction cycle. The DSP32 architecture also introduced several significant architectural 
features, many of which are still unequaled by competing digital signal processors. The DSP32 was the first DSP 
to offer a byte-addressable address space, the first with a unified memory architecture (program and data share 
the same space), the first to offer DMA, and the first DSP architecture to support 4 memory accesses per 
instruction cycle (a feature still exclusive to AT&T DSPs). 


1.2.2 The AT&T DSP32C Digital Signal Processor 


The DSP32C Digital Signal Processor is upward compatible with the DSP32 (object code, source code, and pin 
compatibility). The DSP32C offers higher throughput, lower power dissipation, and lower cost than the DSP32, 
Major enhancements over the DSP32 include a larger address space, higher throughput parallel VO, a fully 
vectored interrupt structure, and many instruction enhancements to better address the needs of DSP systems 
designers. The DSP32C has been very successful in the 3D graphics, speech processing, PC add-on card, and 
telecommunications application areas. The DSP32C is fabricated in 0.75 um CMOS technology and is packaged 
in standard 164-pin and 68-pin (microcomputer version) plastic packages as well as a 133-pin ceramic PGA 
package. i 


The DSP32C's two execution units, the control arithmetic unit (CAU) and the data arithmetic unit (DAU), are used 
to achieve the high throughput of 12.5 million instructions per second. The CAU performs 16- or 24-bit integer 
arithmetic and logic operations, and provides data-move and control capabilities. This unit includes 22 general- 
purpose registers. The DAU performs 32-bit floating-point arithmetic for signal processing functions. It includes a 
32-bit floating-point multiplier, a 40-bit floating-point adder, and four 40-bit accumulators. The multiplier and the 
adder work in parallel to perform 25 million floating-point computations per second. The DAU also incorporates 
special-purpose hardware for data-type conversions. 


On-chip memory includes 1536 words of RAM. Up to 16 Mbytes of external memory can be directly addressed by 
the external memory interface that supports wait states and bus arbitration. All memory can be addressed as 8-, 
16-, or 32-bit data, with the 32-bit data being accessed at the same speed as 8- or 16-bit data. 


The DSP32C has three I/O ports: an external memory port, a serial port, and a 16-bit parallel port. In addition to 
providing access to commercially available memory, the external memory interface can be used for memory- 
mapped I/O. The serial port can interface to a time division multiplexed (TDM) stream, a codec, or another 
DSP32C. The parallel port provides an interface to an external microprocessor. Three on-chip direct-memory- 
access (DMA) controllers support direct memory access via the serial input, serial output, and parallel I/O ports. 
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A single-level interrupt facility can respond to four internal and two external, individually maskable sources. A 
relocatable vector table controls program flow based on the source of the interrupt. 


The DSP32C has two general types of instructions that correspond to the two execution units: data arithmetic 
(DA) instructions and control arithmetic (CA) instructions. Primarily, DA instructions perform 32-bit floating-point 
multiply/accumulate operations for signal processing algorithms. Other DA instructions convert the DSP32C 
internal floating-point data to and from each of the following types: 8-, 16-, or 24-bit 2's complement integer, 8-bit 
p-law, 8-bit A-law, or 32-bit single-precision IEEE standard floating-point. The CA instructions perform 
microprocessor operations such as 16- and 24-bit integer arithmetic and logic functions, conditional branching, 
and moving data. 


In summary, some of the key features of the DSP32C are as follows: 


Full, 32-bit floating-point arithmetic 

Four memory accesses per instruction cycle 

Single, linear address space 

Byte addressable internal and external memory 

Up to 16 Mbytes of external memory 

1536 words of on-chip RAM 

16 Mbits/s serial I/O ports and 16-bit parallel I/O port with DMA options 
8- or 16-bit microprocessor interface requiring no additional logic 
Single-precision, single-cycle, IEEE floating-point conversion capability 
Bit reverse addressing mode 

Four 40-bit accumulators 

Zero-overhead looping 

16- and 24-bit integer operations 

Vectored Internal and external interrupts with hardware context save 
Low-power CMOS technology 


The DSP32C also includes extensive hardware and software support from both AT&T and many third parties. A 
bus functional simulation model is offered by Logic Automation, a complete DSP32C hardware emulator is 
available from Hewlett-Packard (64773). For more information on the DSP32C, refer to the DSP32C Information 
Manual, DSP32C Data Sheet, or any of several software support documentation manuals available from your 
AT&T account manager. 


1.2.3 The AT&T DSP3210 Digital Signal Processor 


The DSP3210 Digital Signal Processor is the newest member of AT&T's Floating-Point DSP Family. The 
DSP3210 is an enhancement of the DSP32C architecture designed specifically to meet the needs of the 
PC/Workstation multimedia market. The DSP3210 is a three-part product composed of the DSP3210 device, the 
VCOS real-time OS, and the VCOS multimedia function library. The DSP3210 is a very low-cost systems solution 
(no local DSP RAM is required for systems solutions) with complete applications support and full, 32-bit floating- 
point processing power, The DSP3210 is source code compatible with the DSP32C via AT&T's DSP32C to 
DSP3210 source code translator. New features added to the DSP3210 are briefly outlined in Table 1-1. 
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Table 1-1 DSP3210 Features 


(aes acta ee Pentute me se | Benefit 
32-bit addressing Compatible with host address space. 
Bi 










Barrel Shifter etter bit manipulation. 
2048 32-bit words of contiguous on-chip RAM RAM easily traded between program and data. 






Quad-word transfer capability Increased system performance. 
Programmable 32-bit timer Reduces system cost. 


routines 


Repeat facility for single instruction do loops Speeds block move operations. 
(no re-fetching of instruction) 


Big or little endian byte ordering Compatible with Motorola and Intel processors. 


Compatibility with Motorola and Intel bus protocols Simplifies hardware design. 
(byte select of size/offset) 













Page break detect signal for DRAMs Improves memory system performance. 
Support for signed characters and unsigned shorts Improved compiler performance. 
Packaged in a low cost 132 PQFP Low cost, small footprint. 


1.3 Support Software Tools 


A variety of tools are available to aid in application program development for the DSP3210. These tools are 
initially offered under the UNIX®, MS-DOS’, or Macintosh I/t operating systems. In addition to 
Assembler/Simulator/Compiler support tools, the DSP3210 includes the AT&T VCOS real-time operating system 
complete with its multimedia function library as well as powerful debugging and testing tools. 


AT&T DSP3210 Support Software Tools 


Software tools used to create, test, and debug DSP3210 application programs at the assembly language level are 
packaged in the DSP3210 Support Software Tools package. An assembler, link editor, simulator, and other 
utilities are included. The simulator, which has capabilities such as single-stepping, breakpointing, and execution 
profiling, performs precise simulations of the device. 


* MS-DOS is a registered trademark of Microsoft. 
t Macintosh Ilis a registered trademark of Apple Inc. 
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AT&T DSP3210 Application Software Library 


The DSP3210 Application Software Library is a collection of commonly-used signal processing and 3D graphics 
functions. Each function is carefully benchmarked to allow applications to easily be built from collections of 
assembly language functions while meeting real-time processing requirements. Many functions are offered in 
several implementations, giving the programmer flexibility in balancing speed and memory requirements . 


AT&T DSP3210 C Language Compiler 


The optimizing C Language compiler for the DSP3210 allows application programs to be written in ageneral, 
high-level language. In applications where preliminary program development is performed using the C-language 
on a minicomputer or supercomputer, the source code can be ported to the DSP3210 with a minimal amount of 
time and effort. A symbolic debugger and several libraries of commonly used arithmetic and signal processing 
functions are included with the compiler. 


1.4 Other Applicable Documentation 


The following documents provide specific information on the DSP3210 and its hardware and software 
development tools. When designing application hardware and software, it is important to have accurate 
information. Contact your local account representative for the latest issue of a document. 


The AT&T DSP3210 Digital Signal Processor Data Sheet contains up-to-date timing requirements and 
specifications, electrical characteristics, and a summary of the instruction set and device architecture. 


The AT&T DSP3210 Support Software Tools User Manual describes how to install and use the DSP3210 support 
software. 


The AT&T DSP3210 Application Software Library Reference Manual contains information on the use and design 
of each assembly-language routine includéd in the DSP3210 Application Software Library. 


The AT&T DSP3210 C-Language Compiler User Manual describes how to set up and use the DSP3210 C 
compiler. 


The AT&T DSP3210 C-Language Compiler Library Reference Manual provides the information necessary to 
implement the C programming language with the DSP3210. 


1.5 Assistance 


Assistance is available throughout the life of the product. These services include: 


= Technical documentation and product samples 
= Information on determining and selecting the appropriate hardware and software 


The AT&T DSP Bulletin Board provides the latest and most up-to-date information about AT&T DSP products and 
application assistance: 


1200/2400 baud 
7 data bits, 1 stop bit, even parity 
215-778-4444 
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For technical assistance or further information including ordering information and part numbers, please contact 
the nearest office through the following phone numbers: 


Domestic (USA & Canada) 


Northeast Region Mid-Atlantic Region Southeast Region 
508-626-2161 215-768-2626 404-446-4700 
North Central Region South Central Region Southwest Region 
612-885-4300 214-869-2040 602-244-1100 
Rocky Mountain Region Pacific Northwest Southern California 
303-850-5415 408-522-5555 714-220-6223 
International 

Europe (except Spain & Portugal) Spain and Portugal 

+49 89 950 86 0 +34 1 404 6012 

Telefax: +49 89 950 86 111 Telefax: +34 1 404 6252 

Japan Pacific Rim 

813-593-3301 65-225-5233 

Telex: J32562 ATTIJ Telex: RS 42898 

Telefax: 813-593-3307 Telefax: 65-225-8725 


Internal (AT&T Customers) 


AT&T internal customers should contact their local AT&T Account Management Office. 
If the Account Management telephone number is not known, call 1-800-372-2447 and ask for the telephone 
number of your account representative. 
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2, APPLICATIONS...... 
2.1 PC/Workstation Multimedia Applications . 


Figure 2-1 Typical PC Multimedia System Configuration 
2.2 3-D Graphics Applications 


Figure 2-2 Typical 3-D Graphics System Configuration. 
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The DSP3210 can be used in many different application areas including: telecommunications, speech processing, 
image processing, graphics, array processors, robotics, studio electronics, instrumentation, and military 


applications. 


ELECTRONIC DATA PROCESSING 


= Mass Memory 
= Workstations 


= Front-End Processor 


INDUSTRIAL 


= Robotics 

= Image Processing 

= Process Control 

= Real-Time Simulators 
= Instrumentation 


TELECOMMUNICATIONS 


PBX 
Switches 
Modems 
Transmission 


GOVERNMENT/MILITARY 


Sonar 

ECM 

Airframe 

Radar Tracking 


SPEECH 


= Recognition 
= Synthesis 
= Coding 


CONSUMER 


= Studio Electronics 
= Entertainment 
= Educational 


Disc controllers, high-precision servo control 

Graphics, translations, rotations, shading, perspective scaling, 
inversion, multiplication, numeric accelerators, array processing 
Bit manipulation, encryption 


High-precision servo control 

Restoration, pattern recognition, compression 
Minicomputer functions 

Graphics, servo control, system modeling 

Oscilloscopes, FFT, spectrum analysis, signal generators 


Tone detection, tone generation, MF, DTMF 

Tone detection, tone generation, line testing 

Echo cancellation, filtering, error correction and detection 
Multi-pulse LPC, ADPCM, transmultiplexing, encryption 


Beam forming, FFT 

FFT, adaptive filtering 
Simulation 

Precision FFT, matrix inversions 


Feature extraction, spectrum analysis, pattern matching 
LPC, format synthesis 
ADPCM, LPC, multi-pulse LPC, vector quantization 


Digital audio 
High-end video (special effects) 


When the software required for a DSP application is being developed, it is often desirable to use some standard 
algorithms already in existence. Each application requires a somewhat different use of these algorithms, yet there 
are great similarities from the processing point of view. Many of the algorithms available are usable in a wide 
variety of physical problems. The driving force behind most of the algorithms has been an attempt to reduce the 
computational and data transfer requirements to accommodate the performance constraints of available 
hardware. Many of these algorithms have been coded in DSP3210 assembly language and are presented in the 
AT&T DSP3210 Application Software Library Reference Manual. 
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2.1 PC/Workstation Multimedia Applications 


The DSP3210 is intended to be used in PC and workstation system architectures in which the DSP3210is a 
parallel processor to a host processor. The DSP3210 maintains a 32-bit bus master interface to system memory 
(see Figure 2-1). The primary benefit of this system architecture is that the DSP is able to access program and 
data from system memory without host intervention, and expensive local SRAM is replaced by general purpose 
DRAM. Since the DSP3210 supports both big- and little-endian byte ordering, sharing both data and pointer 
values with any host microprocessor is easily accomplished. This is especially useful in multimedia applications 
where intimate communications between the host uP and DSP are necessary. For real-time signal processing, 
on-chip SRAM is loaded with code and data from system memory before executing. Typically, applications are 
broken down into functions that are executed successively in this fashion. 


This technique is also used to create low-cost EISA and MCA add-on cards. The DSP3210 acts as a bus master 
on the 32-bit bus to access system memory (rather than adding memory to the DSP3210 card). ISA 16-bit add-on 
cards typically employ local memory since their 16-bit width would present a memory bottleneck to the DSP3210. 
However, the visible caching technique employed by the VCOS operating system permits the use of slow, 
inexpensive DRAM for local memory with minimal impact on system performance. Applications are compatible 
across all implementations described here. 


Low-cost, single-chip 
AID and D/A converter 








On-chip RAM used for kernal storage 
and program/data cache. 
Telephone 


Speaker/ 
Microphone 
ie) 


BUS MASTER 
INTERFACE 


SYSTEM BUS [J~ 


SYSTEM 
MEMORY 


Figure 2-1 Typical PC Multimedia System Configuration 






Digital Audio Data 








Direct interface to system bus 
provides high bandwidth and 


ye low cost. 
System memory provides low-cost 


storage of functions and data. 
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2.2 3-D Graphics Applications 


Although the DSP3210 was primarily designed for personal computer/workstation multimedia applications, it is 
ideally suited to perform the floating-point, compute intensive calculations in a typical 3-D transformation pipeline. 
In fact, while performing graphics operations, the DSP3210 often outperforms far more costly processors which 
are touted for their graphics capabilities. These operations consist of matrix multiplies, divides, square root 
computations, comparisons, and integer conversions. In addition to its native floating-point performance, byte 
addressability, and large address space, the DSP3210 has on-chip hardware to improve the efficiency of many of 
these operations such as: reciprocal seed, 8-, 16-, and 32-bit integer to/from floating-point conversion, clip-test 
register, Z-buffering, etc. A typical graphics system will contain multiple DSP3210 processors that operate in 
parallel to render the image. The low cost of the DSP3210 makes multiple DSP graphics systems practical. Its 
page-break and quarter-cycle wait state features significantly reduce system costs while increasing system 
performance. Inexpensive DRAM can be used for local memory while still achieving the high throughput 
necessary for high-speed 3-D transformations. The DSP3210's flexible bus protocol also makes it possible for 
several DSP3210 devices to share the same command and output buffers with little or no performance 
degradation, further reducing system costs. Figure 2-2 shows a typical 3-D graphics subsystem architecture 
employing a common command buffer/output buffer configuration. 






MULTIPLE DSP3210s 








| 
Y 
s BGACKN,LOCKN 
uy 
HOST E 
CPU M 


cow 





BRN, BGN, 
BGACKN,LOCKN 


MEMORY BRN : 

INTERFACE RAMDAC 
DSP N 

MONITOR 


Figure 2-2 Typical 3-D Graphics System Configuration 
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3. DSP3210 ARCHITECTURE 


The DSP3210 is a 32-bit, floating-point, programmable digital signal processor. It has been designed for use in 
numerically intensive applications requiring fast floating-point arithmetic, efficient compiler-generated code, low 
system cost, and low power dissipation. This has resulted in an architecture with two execution units, a 32-bit 
integer RISC controller and a 32-bit floating-point data processing unit. Also, two 1K x 32 bit parallel, on-chip 
memories and a high-bandwidth bus interface deliver operands and instructions to the execution units supporting 
up to four memory accesses per instruction cycle. On-chip peripherals including a serial interface port, bit /O 
port, and timer reduce overall system cost. Table 3-1 summarizes key attributes of the DSP3210 architecture. 


Table 3-1 DSP3210 Features/Benefits 


Full 32-bit floating-point architecture: Simplifies program development. 
-Increased precision and dynamic range Provides faster time to market. 

Much easier algorithm development. 
Opens up new application possibilities. 


All instructions are single-cycle: Allows more complex functions or a greater 
—No multi-cycle branches number of simultaneous functions to be performed. 
Hardware data format conversions: Eliminates lenghty software routines. 


IEEE P754 Floating-Point Permits shared data with host processor or other 
platforms. 
















































—Integer conversion Increased throughput in: 
8-bit unsigned Graphics and image processing 
16-bit linear . Data communications 
32-bit linear High quality audio and control applications 
j-law/A-law Telecom and speech applications 





context save interrupts/sec). 
Configurable bus interface: Simplifies/reduces cost of memory controller. 
-32-bit addressing Compatible with uprocessor system busses. 
—Support for relinquish/retry/ous error 
—Page break detect 


Byte-addressable address space 


Flexible wait-state facility: Greater memory speed selection flexibility than 
—Each wait-state is 1/4 instruction cycle conventional full cycle wait-states. 

—Two independently configurable external memory | Allows mixing of slow and fast memory. Optimizes 
speed partitions system speed/cost requirements. 
On-chip peripherals: Low cost interface to external devices. 
—Serial 1/O with DMA capability Lowers system cost. 
—32-bit timer 


























Improves performance in designs using DRAM. 


Efficient storage of 8- and 16-bit data. Address pointer 
compatibility with microprocessors. 
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3.1 Functional Units 


The DSP3210 consists of seven functional units: Control Arithmetic Unit (CAU), Data Arithmetic Unit (DAU), On- 
chip Memory (RAMO, RAM1, Boot ROM), Bus Interface, Serial I/O (SIO), DMA Controller (DMAC), and 
Timer/Status/Control (TSC). Figure 3-1 is a block diagram of the DSP3210. 


3.1.1 Control Arithmetic Unit (CAU) 


The CAU is responsible for performing address calculations, branching control, and 16- or 32-bit integer arithmetic 
and logic operations. It is a RISC core consisting of a 32-bit arithmetic logic unit (ALU) that performs the integer 
arithmetic and logical operations, a 32-bit program counter (PC), and twenty-two 32-bit general-purpose registers. 
The CAU can execute 16.7 MIPS (Million Instructions Per Second). 


The CAU performs two tasks: one executes integer, data move, and control instructions (CA instructions), and the 
other generates addresses for the operands of floating-point instructions (DA instructions). CA instructions 
perform load/store, branching control, and 16- and 32-bit integer arithmetic and logical operations. DA 
instructions can have up to four memory accesses per instruction, and the CAU is responsible for generating 
these addresses using the post-modified, register-indirect addressing mode—one address in each of the four 
states of an instruction cycle. For a more detailed description of the CAU, see Section 8.1 Control Arithmetic Unit. 


3.1.2 Data Arithmetic Unit (DAU) 


The DAU is the primary execution unit for signal processing algorithms. This unit contains a 32-bit floating-point 
multiplier, a 40-bit floating-point adder, four 40-bit accumulators, and two control registers (dauc and ctr). The 
multiplier and adder work in parallel to perform 16.7 million computations per second of the form (a=b+c*d). The 
multiplier and adder each produce one result per instruction cycle. The DAU contains a four stage pipeline: 
operand load, multiply, accumulate, and result store. Thus, in any instruction cycle, the DAU may be processing 
four different instructions, each in a different stage of execution. 


The DAU supports two floating-point formats, single precision (32-bit) and extended single precision (40-bit). 
Extended single precision provides 8 additional mantissa guard bits. Post-normalization logic transparently shifts 
binary points and adjusts exponents to prevent inaccurate rounding of bits when the floating-point numbers are 
added or multiplied, eliminating concerns like scaling and quantization error. All normalization is done 
automatically, so the result in the accumulator is fully normalized. 


Single instruction, data type conversions are done in hardware in the DAU, reducing overhead required to do 
these conversions. The DAU performs data type conversions between the DSP3210 32-bit floating-point format 
and IEEE P754 standard 32-bit floating-point, 16- and 32-bit integer, 8-bit unsigned, u-law and A-law formats. 
The DAU also provides an instruction to convert a 32-bit floating-point operand to a 3-bit seed value used for 
reciprocal approximation in division operations. For a more detailed description of the DAU, see Section 8.2 - 
Data Arithmetic Unit. 


3.1.3 On-Chip Memory 
The DSP3210 provides on-chip memory for instructions and data. Instructions and data can arbitrarily reside in 
any location in the on-chip memory. The DSP3210 provides two 1K x 32 RAMs and a 256 x 32 Boot ROM. The 


memories operate in a parallel fashion to achieve high performance. The Boot ROM is preprogrammed with a 
routine that can load the internal RAM of the DSP from an external memory, such as an EPROM. 
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3.1.4 Bus Interface 


The external address bus of the DSP3210 is 32-bits wide and fully byte-addressable, allowing the DSP3210 to 
directly address 4 Gbytes of memory or memory-mapped hardware. External memory is partitioned into two 
logical address spaces A and B. Each partition contains approximately 2 Gbytes of address space. The number 
of wait-states for external memory paritions A and B are independently configurable via the pew register. 
Configured waits of 0-, 1-, 2-, or 3-or-more wait-states are programmable; this simplifies the interface to fast 
external memory. Unlike most digital signal processors (which employ full-cycle wait-states), the DSP3210 offers 
much greater flexibility by offering 1/4 cycle wait-states. Each wait-state is 1/4 of an instruction cycle, allowing 
much greater granularity in determining optimal speed/cost memory trade-offs. When waits are externally 
controlled, the DSP adds wait-states until the memory acknowledges the transaction via the SRDYN pin. The bus 
interface supports retry, relinquish/retry, and bus error exception handling. All signalling provided to the external 
system is configurable on reset to simplify the interface to a variety of microprocessor system buses. 


Sharing the external memory interface is performed via a complete request/acknowledge protocol. System 
throughput is greatly enhanced by the DSP3210's ability to execute from internal memory while the DSP3210 
does not have ownership of the bus. The DSP3210 continues to execute from internal memory until accesses to 
the external memory are needed. At that point, the DSP asserts the BRN signal and wiats for BGN. The bus 
arbiter acknowledges the bus request by asserting the DSP3210's bus grant pin, BGN. The DSP3210 then 
acknowledges the grant by asserting the bus grant acknowledge, BGACKN, and drives the external memory 
interface pins. When BGN is negated, any on-going external memory transaction is completed before the DSP. 
relinquishes the bus by placing the external memory interface bus in the high-impedance state and negating 
BGACKN. See Section 6 - Bus Interface. 


The DSP3210 shares the external memory interface bus through a request/acknowledge protocol. The external 
memory interface bus consists of the address bus, A2—A31, the data bus, D0—D31, and associated control pins 
(See Section 6 - Bus Interface). The DSP is a default bus slave. When the DSP does not have ownership of the 
bus, it executes from internal memory until accesses to the external memory are needed, and at that point the 
DSP asserts the BRN and waits for BGN. The bu8 arbiter acknowledges the bus request by asserting the DSP's 
bus grant pin, BGN. The DSP acknowledges the grant by asserting the bus grant acknowledge, BGACKN and 
driving the external memory interface pins. When the BGN is negated, any on-going external memory transaction 
is completed before the DSP relinquishes the bus by placing the external memory interface bus in the high- 
impedance state and negating BGACKN. See Section 6 Bus Interface. 


3.1.5 Serial /O (SIO) 


The SIO unit provides serial communications and synchronization with external devices. The external signals 
provided support a direct interface to a time-division-multiplexed (TDM) line, a zero-chip interface to codecs, and 
direct DSP-to-DSP transfers for multiprocessor applications. The SIO performs serial-to-parallel conversion of 
input data and parallel-to-serial conversion of output data, at a maximum rate of 25 Mbits/s. It is composed of a 
serial input port, a serial output port, and on-chip clock generators. Both ports are double buffered so that back- 
to-back transfers are possible. The SIO is configurable via the ioc register. The input buffer, ibuf, the output 
buffer, obuf, and the ioc register are accessible as MMIO (Memory-mapped Input/Output) registers in the 
instruction set. 


The data sizes of the serial input and output can be selected independently. Input data lengths of 8, 16, and 32 
bits, and output data lengths of 8, 16, 24, and 32 bits can be selected. The input and output data may be selected 
to be most significant bit first or least significant bit first independently. 


SIO transfers can be made under program, interrupt, or DMA control. A program may test the input or output 
buffer status flags using conditional branch instructions. By configuring the exception mask register, emr, 
interrupt requests may be generated by the input and output buffer status flags. In DMA mode, transfers occur 
between ibuf, obuf, and memory without program intervention. 


For a detailed description, see Section 9.1 Serial I/O. 
3.1.6 DMA Controller (DMAC) 


The DMA controller contains two DMA channels, one for input DMA and one for output DMA , that are used in 
conjunction with the serial /O, By configuring the input DMA channel, data being shifted into the serial input port 
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can be buffered in memory without processor intervention. By configuring the output DMA channel, a buffer of 
data in memory can be supplied to the serial output, as necessary, without processor intervention. The registers 
to configure the DMA Controller are accessible as MMIO registers in the instruction set. By configuring the 
exception mask register, emr, interrupt requests may be generated when the memory buffer has been filled or 
emptied based on the size of the buffer requested. 


For a detailed description see Section 9.2 DMA Controller. 
3.1.7 Timer 


The timer is a programmable 32-bit interval timer/counter that can be used for interval timing, rate generation, 
event counting, or waveform generation. The input to the timer can be derived from the DSP3210 clock, or it may 
come from an external source. The output of the timer can generate a maskable interrupt or be selected as an 
output of the chip to drive external hardware. The count-down timer can be configured to count to zero once, or to 
count continuously by automatically reloading the counter with its initial value when it reaches zero. The count 
value may be read or changed at any time during operation. The registers associated with the timer are 
accessible as MMIO registers in the instruction set. By configuring the exception mask register, emr, interrupt 
requests may be generated when the count reaches zero. 


For a detailed description, see Section 9.3 Timer. 
3.1.8 Bit 1/0 (BIO) 


The BIO is a general purpose 8-bit input/output port. It includes features that make it suitable for board-level 
status signal generation and control signal testing by the DSP3210. The BIO interface consists of 8 I/O lines, 
which can be independently configured as an input or an output. Outputs can be written with a 1 or a0, toggled, 
or left unchanged. Inputs can be directly read and loaded into a CAU register and then tested. The registers 
associated with the BIO are accessible as MMIO registers in the instruction set. Four of the BIO ports are used 
on reset to configure the memory map and bus interface. After reset, they can be used arbitrarily. 


For a detailed description, see Section 9.4 Bit /O. 


3.2 Processor Control Features 


The DSP3210 supports advanced control features that simplify system design and improve software performance. 
This section overviews serial I/O direct-memory access (DMA), exceptions, and the powerdown mode. A more 
detailed description of these control functions is presented in Section 7 Processor States. 


3.2.1 Serial /O Direct Memory Access (DMA) 


External devices can access the on-chip RAM in the DSP3210, as well as external memory, using DMA. DMA 
transfers occur between the memory and the serial I/O ports without processor intervention, using cycle-stealing. 
Two on-chip DMA controllers support memory access via the serial input and serial output ports. See Section 7.1 
Serial /O and Section 7.2 DMA Controller. 


3.2.2 Exception Processing 


Normal instruction processing can be altered by the introduction of interrupt routines or error handling routines. 
Exception processing is the set of activities performed by the processor in preparing to execute a handler routine 
or in returning to the program that took the exception. Error exception and interrupt exceptions cause different 
activities to be performed. In particular, error exceptions abort the current instruction and can not resume 
processing. Interrupt exceptions shadow the current state of the processor before taking the interrupt exception; 
therefore, when the interrupt routine is complete the program can be reinstated and continued. 


The error and interrupt exceptions are prioritized and some error sources in addition to all interrupt sources are 
individually maskable via the exception mask register, emr. A relocatable vector table controls program flow 
based on the source of the interrupt exception. In response to a given exception, the DSP branches to the 
corresponding address in the exception vector table which contains pairs of 32-bit words. 
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The DSP3210 provides a zero-overhead context save of the entire DAU (floating-point unit) for interrupt 
processing. Interrupt latency is only three instruction cycles for interrupt entry and one instruction cycle for an 
interrupt exit. Before servicing the interrupt, the DSP3210 automatically saves the state of the machine that is 
invisible to the programmer, as well as the DAU accumulators a0—a3 (including guards bits), all DAU flag status 
information, and the dauc register. Internal states that are visible to the programmer are saved and restored by 
the interrupt service routine. To return to the interrupted program, the interrupt service routine restores the user- 
visible state of the DSP3210 (that was saved) and then executes the ireturn instruction. This single-cycle 
interrupt return automatically restores the entire DAU state saved during interrupt entry. Quick interrupt 
entry/exit/context save is critical to real-time multimedia tasks. 


For a detailed description of exception processing, see Section 7.4 Instruction Sequencing Exceptions. 
3.2.3 Wait-for-Interrupt 


The DSP3210 is equipped with a wait-for-interrupt instruction that stops internal execution in the DAU and CAU. 
External memory and internal memory operations execute to completion and then wait. Bus arbitration logic 
remains active in this mode. The peripheral units (Serial /O, DMA controller, Timer, and BIO) remain active 
during wait-for-interrupt to perform /O events and the interrupt handler is active to sense an interrupt request. 
The DSP will exit the wait-for-interrupt mode when either an unmasked interrupt is requested or an error 
exception occurs. 


3.3 Data Types 


Internally, the DSP3210 fully supports three data type formats: 16- and 32-bit 2's complement integers and 32-bit, - 
floating-point numbers. The control arithmetic unit (CAU) performs arithmetic and logical operations on the 16- 

and 32-bit integer formats. It also has the ability to load and store 8-bit signed and unsigned, and 16-bit unsigned 

data. The data arithmetic unit (DAU) performs multiply and accumulate operations on the 32-bit, floating-point 

data and local to the DAU supports a 40-bit extended precision format. The DAU also performs data type 

conversion between IEEE P754 single precision, 8-bit unsigned, j1-law and A-law, and 16- and 32-bit 2's 

complement. 


3.3.1 32-bit 2's Complement Data Type 


The CAU supports a 32-bit, 2's complement integer data type that can be used as 32-bit integer data or for 
memory pointers. The 32-bit integers range in value from -231 to 231 — 4. Thirty-two bit arithmetic operations are 
the default type. All pointers must be 32-bit data types. The organization of 32-bit data in CAU registers is: 


31 0 
32-BIT INTEGER 
MSB LSB 
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3.3.2 16-bit 2's Complement Data Type 


The CAU supports a 16-bit, 2's complement integer data type that is used as 16-bit integer data. The 1 6-bit 
integers range in value from -2'5 to 2'5- 1. Because CAU registers are 32-bits wide, when a 16-bit integer 
operation is performed, data should be loaded into the lower 16-bits of the register (with a sign-extend of bit 15 
into the upper 16 bits). A 16-bit operation performs 32-bit arithmetic with flags computed according to 16-bit 
operation flag rules (see Section 4.1 Flags). If the results are stored back to memory, the lower 16-bits of the 
result should be stored into a 16-bit memory location. The organization of 16-bit data in CAU registers is: 


SIGN-EXTENDED 16-BIT INTEGER 





3.3.3 32-bit Floating-Point Data Type 


There are two floating-point formats used inside the DAU: a 32-bit format and a 40-bit format (32-bits, plus 8 
mantissa guard bits). The latter format, used only inside the DAU, is not supported by the rest of the device. The 
DAU accumulators are 40-bits. The guard bits are not user-accessible. For more information about the 40-bit 
format, see Section 8.2 - Data Aritmetic Unit. The organization of floating-point data in DAU accumulators 
(a0—a3) is: 





f 
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The following is a description of the 32-bit floating-point format. 
Format 


The DSP3210 internal 32-bit floating-point format consists of a 24-bit, normalized, 2's complement mantissa and 
an 8-bit biased exponent. Figure 3-2 shows the DSP3210 32-bit floating-point format. 





MSB 


Figure 3-2 DSP3210 Internal 32-bit Floating-Point Format 


where: s = the sign bit (0 for positive, 1 for negative) 
f = fractional part of the mantissa 
s,f = the mantissa 
e = the exponent 
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This floating-point quantity is represented in decimal as follows: 
N=M*2(e-128) 


where: N = the decimal value 
e = the exponent, the unsigned quantity “eeeeeeee” expressed in base 10. 
M = mantissa, the 2's complement quantity “s(!s).ffff...ffff is equivalent to (-2)S + 
O.ffff...ffff expressed in base 10. 
* = multiplication si 


The magnitude of the mantissa is always normalized to lie between 1 and 2. So, the leading 1 (or leading 0 ifa 
negative mantissa) does not appear explicitly in the floating-point word. This implicit bit is the s bar 
(!s ) that appears above. - 


Zero 


A floating-point value with e=0 is reserved to represent the number zero. Here, the sign and mantissa bits must 
also be zero. A zero exponent with a nonzero sign and/or mantissa is called a dirty zero, but is treated the same 
as zero. If adirty zero is an operand, it is treated as a zero. Any dirty zero generated in the floating-point 
multiplier or adder is flushed to a valid zero before being written to the accumulators or memory. 


Range 


The 32-bit floating-point format used in the DSP3210 provides over 1500 dB of dynamic range. The range of 
nonzero positive floating-point numbers is: 


N = [1°2-127, [2-223] + 2127) inclusive. 
The range of nonzero negative floating-point numbers is: 

N = [-[142°23] * 2-127, -2 * 2127) inclusive. 
The ranges of positive and negative nonzero numbers are almost identical, and the corresponding range for the 
magnitude of a floating-point number N, in decimal, is approximately: 

IN| = [5.87747E-39, 3.40282E38] 


3.4 Memory Organization 


The memory space is byte addressable and can support both little endian and big endian byte orderings. The 

organization of bytes in a 32-bit word is based on the state of the B/LN bit in the pew register. If B/LN = 0, the a 
little endian mode is selected and byte 0 is the least significant byte of the 32-bit word. If B/LN = 1, the big endian 
mode is selected and byte 0 is the most significant byte of the 32-bit word. 8-bit data is referred to as a byte, 
16-bit data is referred to as a half-word, and 32-bit data is referred to as a word. Each 32-bit word is organized as 
four bytes, two half-words, or one word. Data must be aligned on the same byte boundary as its operand size. 
Bytes can be arbitrarily aligned, half words must be aligned on bytes 0 or 2, and words must be aligned on byte 0. 
Figure 3-3 illustrates the addressing of data for both byte orderings. 
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Note: $ indicates address 














Big Endian (B/LN=1) 








Little Endian (B/LN=0) 


31 23 15 7 0 31 23 15 re 0 
00 
$00000000 $00000000 
Half-word Half-word Half-word 
$00000000 $00000002 $00000002 $00000000 
$00000000 | $00000001 | $00000002 | $00000003 $00000003 | $00000002 | $00000001 | $00000000 


Word 
$00000004 


Word 
$00000004 


Half-word Half-word Half-word 
$00000004 $00000006 $00000006 $00000004 
$00000004 | $00000005 | $00000006 | $00000007 $00000007 } $00000006 | $00000005 | $00000004 


Figure 3-3 Memory Addressing 


etc. 


The organization of memory in the DSP3210 address space is shown in Figure 3-4. One of the two memory 
maps is selectable with the C/PN bit of the pew register. The memory map is divided into seven major sections: 
Boot ROM, MMIO (Memory-mapped Input/Output), on-chip RAMO, on-chip RAM1, Reserved (for future expansion 
of ROM and RAM), External Memory A and External Memory B. On-chip SRAM resources consist of two 1K x 32 
banks. The Boot ROM consists of 256 words. The MMIO reserves 256 words. The reserved expansion region 
provides for future expansion of up to 15.5K words of on-chip memory. External Memory A and External Memory 
B are logical partitions in which a different number of wait-states may be configured for each. The number of wait 
states is configured in the pew register. See Section 6 Bus Interface. 


The first address issued on reset is location 0x0 and is located in Boot ROM (if C/PN is initialized to a 1 on reset 
by BIO7) or external memory (if C/PN is initialized to a 0). The generic code provided in the ROM will allow 
redirection of the starting address to locations in external memory A or B. Also, the capability to load internal 
RAM from an external memory device, such as EPROM is provided. This simplifies the use of the DSP3210 in 
embedded applications. The ROM code is described in Appendix A. 
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The MMIO memory space reserves 256 words for accessing on-chip memory-mapped IO registers. The 
DSP3210 contains three peripheral units: SIO, TSC, and DMAC. The control and data registers for these 
peripherals are addressable in the memory space of the DSP3210 as shown in Figure 3-5 IO Memory Map. Each 
register occupies one word memory location. For registers smaller than 32-bits, Figure 3-5 shows the location of 
valid data in the register. MMIO registers are a fixed size and must be read and written using the corresponding 
size indicator. The address of the MMIO register is dependent on the byte ordering. 


Note: Some IO registers have reserved bits for future definition. Those particular bits must be written as zeros for 
future compatibility. 







Reset Address Reset Address 0x00000000 













0x00000000 External 

oxoo000sFF | =BootROM Memory A 
0x5002FFFF 

0x00000400 {0x50030000 

0x000007FF 0x500303FF__| Boot ROM 


0x00000800 
Ox0000DFFF 
0x0000E000 
Ox0000EFFF 


0x50030400 


0x500307FF } mio 
0x50030800 
0x5003DFFF 















(0x0000F000 (0x5003E000 
oxooooFFFF | RAMO 0x5003EFFF 
(0x00070000 
External 0x5003F0000 
Memory A OxSFFFFFFF 
OxSFFFFFFF 







0x60000000 


0x50040000 External 


ll 
External Memory.A 


Memory B OxSFFFFFFF 





OxFFFFFFFF 
0x60000000 
C/PN=1 
OxFFFFFFFF 
C/PN =0 


Figure 3-4 Memory Map 
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16 LSBs 16 LSBs 
of Address | of Address 
(Big Endian) | (Little Endian) 


Reserved 
uf 
obut 
| oxoate | oxo4ic_| 
iont 
a= 
0x0434 
ox07FC 


E)_ Reserved Bits 


Symbolic 
Name 





31 23 















timer 






0x07FC 
RW : ReadMWrite 


Figure 3-5 |O Memory Map 


3.5 Addressing Modes 





Register Bits 


15 


D 
o 
o 
® 
< 
o 
a 


W : Write Only 
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RW 
RW 
RW 
RW 
RW 
RW 


RW 
RW 
Rw 


The DSP3210 supports several data access or addressing modes including immediate, memory-direct, register- 
direct, register-indirect, and register-indirect with postmodification. The addressing modes allowed for each type 
of instruction are shown in Table 3-2. CA data move instructions, which are used to load CAU registers, support 
the greatest number of addressing modes and therefore have the most flexibility. The remainder of the instruction 
types support fewer addressing modes are more tailored to the intended use of the instruction. See Section 4 
Instruction Set for more information about each instruction. This section describes the DSP3210 addressing 


modes. 
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Table 3-2 Addressing Modes Allowed in Each Instruction Type 

















Instruction Type 


CA Data CA Data ‘A Arithmeti D. ‘A and 
Move Group | Move Group | Logic Group | Special Func. 
(CAU Reg) (VO Reg) 
S 





Addressing Mode 








Short Immediate 
24-bit Immediate 
Memory Direct 







a a a Pao ae 
ee. ae ee SS eS eS) 
a Ss a cy 
CAU Register Direct | Yes] ves | ves |_| 
HO Register Direct [Yes |__| oa | 
SaaS 


Register Indirect 
Register Indirect with 
Postmodification 


3.5.1 Short Immediate 


DAU Register Direct = ate] 





Notation: N (16-bit) 


In the short immediate addressing mode, the value N, a 16-bit 2's complement integer, is supplied by the 
instruction, and N is MSB-extended to 32-bits. 


3.5.2 24-bit Immediate 
Notation: MM (24-bit) » 


In the 24-bit immediate addressing mode, the value M, a 24-bit unsigned integer, is supplied by the instruction, 
and M is zero-extended to 32-bits. 


3.5.3 CAU Register (rN) Direct 

Notation: tN (N=0-22) 

In the CAU register direct mode, the operand is a CAU register specified by the instruction. 

3.5.4 10 Register (iorN) Direct 

Notation: iorN (N=1-15) 

In the /O register direct mode, the operand is an IO register specified by the instruction. 

3.5.5 DAU Register (aN) Direct 

Notation: aN (N=0-3) 

In the DAU register direct mode, the operand is a DAU register specified by the instruction. 

3.5.6 Memory Direct 

Notation: *L (16-bit) 

In the memory direct addressing mode, the value L, a 16-bit unsigned integer, is supplied by the instruction. The 
effective address is formed such that it always points to the 64K bytes of on-chip memory space. Therefore, 


when C/PN =1, the upper sixteen bits of the effective address are 0x0000, and when C/PN=1, the upper 16 bits of 
the effective address are 0x5003. 
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3.5.7 Register Indirect 


Notation: *rP (P=0-22 for CA instructions, 
P=1-14 for DA instructions) 


In the register indirect addressing mode, the operand is in memory and the address of the operand is in the 
register specified by the instruction. This register is referred to as a pointer register. The set of registers that can 
be used for this addressing mode is dependent on the instruction type. For all CA Data Move instructions, CAU 
registers ro—r22 can be used as pointer registers. For all DA instructions, CAU registers ri—r14 can be used as 
pointer registers. 


3.5.8 Register Indirect with Postmodification 


Notation: “1P++ post-increment 
“tP-- post-decrement 
*rP++rl post-modify by value in register rl 


(P=0—22 and l=0—22 for CA instructions, 
P=1—14 and l=15—19 for DA instructions) 


In the register indirect addressing mode with postmodification, the operand is in memory and the address of the 
operand is in the register specified by the instruction, rP. This register is referred to as a pointer register. After 
the operand address is used, it is modified by +1, +2, +4, or the value in register rl. rl is referred to as an 
increment register. The post-increment mode modifies the pointer register by +1, +2, or +4 depending on the 
operand size: byte, half-word, or word. The post-decrement mode modifies the pointer register by —1, -2, or —4 
depending on the operand size: byte, half-word, or word. The post-modify by register mode modifies the pointer 
register by the value in the increment register rl. rl is a 2's complement 32-bit integer. The set of registers, 
pointers, and increments that can be used for this addressing mode is dependent on the instruction type. For all 
CA Data Move instructions, CAU registers r0—r22 can be used as pointer registers or increment registers. For all 
DA instructions, CAU registers ri—r14 can’be used as pointer registers, and CAU registers r15—19 can be used 
as increment registers. If the pointer register is the stack pointer, the operand size is always long word and the 
post-increment and post-decrement is always by 4 to keep the stack pointer aligned to a word boundary. 


3.5.9 Other Addressing Modes 
Additional addressing modes, including 32-bit immediate, indexed, relative, memory indirect, bit-reversed, and 
modulo modes can easily be derived by combining two or more CAU instructions as follows. 


—The 32-bit immediate addressing mode is achieved by using the two instruction sequences: 
rD=N 
1D =rS <<|N. 


—The indexed or base plus displacement mode is achieved by using the instruction rD = rS + N and then 
performing register indirect addressing using rD. 


—The relative modes are achieved by using the program counter in the instruction rD = pe + rS and then 
performing register-indirect addressing using rD. rS contains the offset from the pc. 


—The memory indirect mode is achieved using the instruction rD = MEM and then performing register indirect 
addressing using rD. 


—The bit-reversed addressing mode is achieved by using the carry-reverse add function, rD = rD #rS, and then 
performing register indirect addressing using rD. 
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—The modulo addressing mode is achieved by using a two instruction sequence following a postmodification by 
an increment value. Addresses for the beginning and end locations of the circular buffer are placed in registers. 


With: 


Perform postmodification on r3 as usual: 
Two instruction cycles for modulus operation: 


3.6 10 Registers 


r=start_address 
nd_address 
3=current pointer 
r15=increment (step) value 
r4=temporary register 





a0=*r3+4r15 
r4=13-12 
if(pl) r=rt+r4 


The IO register set includes control registers and data registers. IO registers vary in size according to their 
function. IO registers can be 8-bit, 16-bit, or 32-bit registers. The organization of data in the three sizes of 


registers is: 





‘Reserved 


In general, IO registers are a fixed size and must be read and written using the corresponding size indicator. The 
IO register set consists of: 













iord 
ior8 
ior12 
iorl4 
ior15 


Note: Some IO registers have reserved bits for future definition. Those particular bits must be written as zeros for 


future compatibility. 
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4. INSTRUCTION SET 


The DSP3210 has two general types of instructions that correspond to the two execution units: data arithmetic 
(DA) instructions and control arithmetic (CA) instructions. Primarily, DA instructions perform 32-bit floating-point 
multiply/accumulate operations for signal processing algorithms. Other DA instructions convert the DSP's internal 
floating-point data to and from each of the following types: 8-, 16-, or 32-bit integer, -law, A-law, or single- 
precision IEEE floating-point format. The CA instructions perform microprocessor operations such as 16- and 32- 
bit integer arithmetic and logic functions, conditional branching, and moving data. Instruction bit encodings are 
contained in Section 10 - Instruction and Register Set Encodings. 


Figure 4-1 shows the register programming model of the DSP3210. It consists of twenty-two 32-bit general 
purpose registers, one register hardwired to zero, a 32-bit program counter, four 40-bit floating-point accumulator 
registers, and five IO control registers. These registers are summarized below: 


General-purpose CAU 32-bit registers, r1—r22 

CAU register hard-wired to zero, r0 

32-bit program counter, pc, and associated interrupt shadow register, pcsh 

40-bit floating-point accumulator registers, a2—a3, and associated interrupt shadow registers 
Control registers 

—Processor status register, ps, and associated shadow register 

—Exception mask register, emr 

—Processor control word, pcw 

-DAU control register, dauc, and associated shadow register 

—Clip-test register, ctr, and associated shadow register 


Although ri—r22 can be used for general purposes, under certain circumstances these registers are designated 
to be used in specific ways: 


DA instruction memory reference (X, Y, Z fields) pointer registers. 
DA instruction memory reference (X, Y, Z fields) increment registers. 


Used by error exception facility to store the old pc. 
Stack pointer. 
Used as the pointer to the exception vector table. 
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CAU REGISTERS 








DAU REGISTERS 
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Figure 4-1 Register Programming Model 
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4.1 Flags 


The DSP3210 has internal flags that are affected by the results of certain DA, or CA instructions and certain /O 
events. These flags may be tested by conditional instructions and may also be read from the ps register. Table 
4-1 lists the flags and their meaning. 

DSP3210 instructions and the flags affected by each instruction are specified in sections 4.2 DA Instructions and 
4.3 CA Instructions. A zero (0) shown in place of a flag means that the flag is always made zero; a dash (-) in 
place of a flag means that the flag is unaffected by the instruction. 


Table 4-1 DSP3210 Flags 


[False [ Meaning(Flag=1) [Definition 
age Result is negative sign bit = one 
Result is zero mantissa = zero 
Result overflowed |DAU result | > 3.40282E38 
Result underflowed |DAU result | < 5.87747E-39 


ae 
(16-bit) n=b31 (bit 31 of ALU result) 
(16-bit) n=b15 (bit 15 of Barrel Shifter result) 
(32-bit) n=b31 
Result is zero. (16-bit) z=b31 + b30 + ...b1 + bO (+=OR) 


(32-bit) z=b31 +630 + ...b1 + b0 

Carry or borrow out of MSB_ | (16-bit) c=b15¢c (carry out of ALU bit 15) 
(32-bit) c=b31¢ 

Result overflowed (16-bit) v=b14c%b15c (*=exclusive OR) 
(32-bit) v=b30c*b31¢ 


Serial input buffer full IBF =1 

Serial output buffer full | OBE =0 

SY (/O sync) set SY=1 

SIO frame boundary First OBE after SY high-to-low transition 
Interrupt 0 set IRON = 1 

Interrupt 1 set IRiN=1 
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4.2 DA Instructions 


The data arithmetic (DA) instructions are divided into two functional groups: multiply/accumulate and special 
funtions. 


4.2.1 DA Instructions - Multiply/Accumulate 


DA multiply/accumulate instructions perform 32-bit floating-point multiply/accumulate operations for signal 
processing algorithms. The DA multiply/accumulate instructions are listed in Table 4-2. Replacements for the 
operands are listed in Table 4-3. 


Note: { } and [] are not part of the instruction syntax. Parentheses ( ) are part of the syntax and must appear 
where shown in an instruction. Lower-case letters are part of the syntax and upper-case letters are replaced by 
immediate data or by a register name (see tables following each instruction group). A detailed description of each 
instruction is available in Section 4.6 under the Instruction Reference name. 


Table 4-2 DA Multiply/Accumulate Instructions 


DAU Instr. 
Reference 


Flags 
[Z=] aN = [-]aM {+, - FMULT-ACC-STORE | The product of the X and Y fields is 
}Y*X 






Description 









added/subtracted to/from the accumulator aM and 
the result is stored in accumulator aN. The result 
can also be output according to the Z field. 

The Y field operand is output according to the Z 
field. The product of the X and Y fields is added to 
accumulator aM and the sum is stored in 
accumulator aN. 

The product of the X field and accumulator aM is 
added/subtracted to/from the Y field. The result is 
placed in accumulator aN and can also be output 
according to the Z field. 

The product of the X and Y fields is 
added/subtracted to/from zero. The result is 
stored in accumulator aN and can also be output 
according to the Z field. 

The value of the Y field is output according to the Z 
field. The product of the X and Y fields is stored in 
accumulator aN. 

The sum or difference of the Y and X fields is 
stored in accumulator aN and the result can also 
be output according to the Z field. 

The value of the Y field is placed in accumulator 
aN and can also be output according to the Z field. 
The sum or difference of the X and Y fields is 
stored in accumulator aN, and Y is output 
according to the Z field. 











aN = [-JaM {+, - FMULT-ACG-TAP 


\(Z=Y)"x 













FMULT-ADD-STORE 











[Z=] aN = JY {+,— 
jaM"x 














[Z=] aN = [-]Y *X FMULT-STORE 












aN = [-] (2=Y)"X FMULT-TAP 














[Z=] aN = [-Y {+,-}X FMULT-STORE 




















[Z=] aN = [-]Y FLOAD-STORE 











aN =] (Z=Y) {+,-} X FADD-TAP 
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Table 4-3 Replacement for DA Multiply/Accumulate Instructions 


One of the four DAU accumulators 


*rP, *rP++, *rP--, *rP+-+rl, 32-bit memory location, or 
a0-a3 one of the four DAU accumulators 
“rP, *rP++, *rP--, *rP++rl 32-bit memory location 


* rP refers to ri—r14 and is used as a memory pointer. rl refers to r15—r19 and is used as an increment register. 





4.2.2 DA Instructions - Special Function 


DA special function instructions perform data type format conversions, rounding, conditional accumulator loads, 
and compute a reciprocal seed. The DA special function instructions are listed in Table 4-4. Replacements for 


the operands are listed in Table 4-5. 
Description 


Input conversion j1-law, A-law, 8-bit linear to float. 
Output conversion float to j1-law, A-law, 8-bit linear. 


Table 4—4 DA Special Function Instructions 
Instruction DAU Instr. 
Flags | Reference 
IC 
oc 


[Z=] aN = ic(Y) 

[Z=] aN = oc(Y) 

[Z=] aN = float16(Y) FLOAT16 | 16-bit integer to float. 

[Z=] aN = float32(Y) FLOAT32 | 32-bit integer to float. 

[Z=] aN = inti6(Y) INT16 Float to 16-bit integer (round or truncate, dauc[4}). 
[Z=] aN = int32(Y) INT32 Float to 32-bit integer (round or truncate, dauc[4)). 
[Z=] aN = round(Y) ROUND | Round to nearest, float(40) to float(32). 

[Z=] aN = ifalt(Y) IFALT Conditional assignment/memory write. 

[Z=] aN = ifaeq(Y) IFAEQ Conditional assignment/memory write. 

[Z=] aN = ifagt(Y) IFAGT | Conditional assignment/memory write. 

[Z=] aN = dsp(Y) DSP. IEEE to DSP format conversion. 

[Z=] aN = ieee(Y) IEEE DSP to IEEE format conversion. 

aN = seed(Y) SEED 32-bit to 32-bit reciprocal seed. 























































One of the four DAU accumulators 
8-, 16-, or 32-bit memory location, or 
one of the four DAU accumulators 

8-, 16-, or 32-bit memory location 








a0-a3 
“rP, *rP++, *rP--, “rP++rl, 
a0-a3 

“cP, *rP++, “rP--, *rP++r1 


* rP refers to r1—114 and is used as a memory pointer. rl refers to r15—r19 and is used as an increment register. 
t Y may not be a0—a3 for the dsp special function. 
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4.3 CA Instructions 


The control arithmetic (CA) instructions perform 16- and 32-bit integer operations. These operations include 
generating addresses, performing integer data and logical functions, instruction flow control, and data movement. 
The instructions are grouped into three major headings: control, arithmetic and logic, and data move. 


4.3.1 CA Instructions - Control 


The CA control instructions alter the ordinary flow of control from one instruction to the next. They include 
branches, loop counter test, subroutine call and return, return from interrupt, do loops, and wait for interrupt. 
Instructions can branch based on CA, DA, or I/O conditions. The condition for the branch is determined by the 
state of the flags resulting from the most recently calculated CA or DA instruction or /O event. The CA control 
instructions have no effect oni the flags. The CA control instructions are listed in Table 4-6. Replacements for the 
Table 4-6 CA Control Instructions 


COND field are listed in Table 4-7. 
Instr. Description 
Reference 


if (COND) goto {N, rB, rB+N}" GOTO-COND | Conditional branch based on flags 
if (rM-->=0) goto {N, rB, rB+N}" GOTO-LOOP | Conditional branch using loop counter 








































goto {N, rB, rB+N, M, rB+M}" GOTO Unconditional branch 
nop NOP No operation 

call {N, rB, rB+N, M} (rM)" CALL Call subroutine 

return (rM) RETURN Return from subroutine 





Do next K+1 instruction(s) L+1 (or rM+1) time(s). K=0, 1, 
2, ...127; LerM=0, 1, 2, ...2047 
dolock signals interlocked bus accesses. 


do K, {L, rM} DO 




















dolock K, {L, rM} DO 


doblock {L, rM} DO doblock signals quad-word transfers. 
ireturn IRETURN | Return from interrupt 
sftrst SFTRST Soft Reset; Changes error level to base level; encoded as 





spc=(byte)rO 
Wait for interrupt; encoded as spc=(long)r0 









WAITI 


* When using pe as rB, the conditional or unconditional goto and call instructions may be written with the syntax pegoto label or pccall label, 
respectively. The linker will compute the appropriate N (offset) value to encode. 


waiti 


where: 
rB = pc, r0-r22 
tM =11-r22 





N = 16-bit signed integer 
M= 24-bit unsigned integer 
COND = one of the DSP3210 Condition Codes (See Tables below) 
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Table 4-7 Replacement for COND 


(COND) - CAU Conditions 


Not applicable | Condition is always true. 
n=0 Result is nonnegative (plus) 
Result is negative (minus) 
= 
Result equal to zero 
‘ 
vs Overflow set, overflowed 














Carry clear, no carry 
Carry set, carry 









n4v=0 Greater than or equal to zero 
nv=1 Less than zero 

Z| (nv) Greater than zero 

Z| (nv) Less than or equal to zero 


i 


Is Less than (unsigned number) 
Condition Flag Meaning 


Z=0 Not aqual to zero 
Equal to zero 


Z=1 
Greater than or equal to zero 
eee ee 
Overflow set, overflowed 
U Underflow clear, no underflow 
[ust [now setingtoed” 
agt N| Greater than zero 
| Condition [Flag | Meaning 
ibf=0 Input buffer empty 
obe obe: Output buffer empty 
syc sy=0 Sync signal low 
sy=1 Syne signal high 
fo=0 Serial frame boundary clear 
Pe | et |somvancacunaayser 
irOs IRON pin is logic one (1). 
IRON pin is logic zero (0). 
IRIN pin is logic one (1). 
irte 


IR1N pin is logic zero (0). 
* Symbol interpretation: * = XOR; |= OR 































Instruction Set 
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4.3.2 CA Instructions - Arithmetic/Logic 


The CA arithmetic/logic instructions perform integer arithmetic and logical operations in the CAU. Instructions of 
this type do not have memory references. The operands for these instructions are contained in the CAU registers 
or supplied in the instruction as immediate data. Operands that are in memory must first be moved to the CAU 
register array using a data move instruction (see 4.3.3 CA Instructions - Data Move). The size of the operation 
can be specified to be long (32-bits), or short (16-bits). Long is the default and does not have to be specified. 


Table 4-8 lists the CA Arithmetic/Logic instructions. 


Symbols in Table 4-8 can be replaced with: 
rD = r0—r22 
D1 = r0—r20, r22 
rS, $1, rS2 = r0—122 
$3 = r0—r22, pc, pesh 
N = 16-bit signed integer 
COND = one of the DSP3210 Condition Codes (see Table 4-7 Replacement for COND) 
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Table 4-8 CA Arithmetic/Logic Instructions 









CAU Instr. Description 
Re a ee 
PO tees crass i> ade eaten n=l oneness mel es 
immediate 

[Da tongS<<IN | na00_|  SHIFT-OR | Shit 16-bitimmediate loft 16 places and OR _| 
[rD= (Wong), (shony}D-N| nave | SUBTRACT [Right subiractwith immediate | 
[D-=((ong), (shon)}N-rD | nave_| _ SUBTRAGT | Left subtractwith immediate =| 
[r= (long) hor} &N—[ nz00_| ADD | Logical AND withimmediate | 
[= (ong). (shon)}rD[N | nzoo| OR | Logical OR with immediate ‘| 
[D={(long)(shon)D"N | nz00_| EXCLUSIVE-OR | Logical XOR with immediate =| 
r= (long) (shor) DEN [-nz0e[ADD-CARRY REV| Garry reverse add with immediate | 
[D=((long).(shony}rO<<N | n200_| SHIFT LEFT [Leh shittbyN(OSNSST) 
[Weng) GhoniyD | nave_| COMPARE | Comparewihimmedate i 
(ong). chon} rD-&N | na00_| BIT TEST | Bit Test with immediate 

HinCOND)IrD = (Wong) (hoy S1+rS2_| neve [| ADD | Conditional Segistoradd_ | 





[if(COND)] rD = {(long), (short)} rS1 - rS2 nzve SUBTRACT Conditional 3-register subtract 

[if(COND)} rD = {(long), (short)} rS1 & rS2 nz00 ADD Conditional 3-register AND 

[if(COND)} rD = {(long), (short)} rS1 &~ rS2 AND- Conditional 3-register AND with complement 
COMPLEMENT 


{A(COND)] rD = {(long), (short) St |rS2_[_+nz00_| 
[if(COND)] rD = {(long), (short)} rS1*rS2__ | _nzoo_| EXCLUSIVE OR 
[if(COND)] rD = {(long), (short)} rS1#rS2__ | _nz0c_| ADD-CARRY REV 
[if(COND)] 1D = {(long), (short)} rS1 << rS2 ‘SHIFT LEFT 
if(COND)] rD = {(long), (short)} rS1 >> rS2 SHIFT RIGHT 


ny 


[_nz00 | 
[_nz00_| 
[if(COND)] rD = {(long), (shor)} rS1 $>>rS2 SHIFT RIGHT- | Conditional arithmetic right shift by rS2 
ARITH (0<rS2<31) 


{f(COND)] {(long), (short)} rS1 = nave COMPARE: 
[f(COND)] {(long), (short)} rS1 & rS2 BIT TEST 


[_nz00_| 

ba afte ete pr ate ero 
[i(COND)] rD = {(long), (short)} rS>>1 7z00_| SHIFT RIGHT _| Conditional logical right shift by 1 

[if(COND)] 1D = {(long), (short)} rS<<1 71200 SHIFT LEFT _ | Conditional left shift by 1 

{if(COND)] D = {(long), (short)} rS*2 nave ADD 
[if(COND)] rD = {(long), (short)} rS>>>1 ROTATE RIGHT 
[if(COND)] rD = {(long), (short)} rS<<<1 [| nzve | ROTATE LEFT 
{if(COND)] FD = {(long), (short)} -rS SUBTRACT 
[if(COND)] rD = {(long), (short)} rS [nave _| 
[if(COND)] rD1 = {(long), (short)} rS {+, -}1 INCR/DECR 


[if(COND)] sp = (long) sp {++, —} INGRIDECR | Conditional increment/ decrement of stack 
















pointer by 4 (Interrupts are disabled for one 
instruction cycle) 
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4.3.3 CA Instructions - Data Move 


The CA data move instructions are divided into four types: (1) immediate data sets of CAU registers, (2) data 
transfers between memory and CAU registers, (3) data transfers between IO registers and CAU registers, and (4) 
data transfers between IO registers and memory. The data can be moved as 8-bit (byte), 16-bit (half-word), or 
32-bit (word). Table 4-9 lists the CA Data Move instructions and Table 4-10 shows the operand replacements. 


When loading CAU registers from memory or an IO register, the size of the transfer may be specified to be 8-bit 
(byte, hbyte, char), 16-bit (short, ushort), or 32-bit (long). Long is the default and does not have to be 
specified. The affect on CAU register bits for each operation is described below: 


Bits Affected 


Type 23—16 15-8 7—0 Description 
Specified 
0 


Zero-extend to upper 24-bits 


(byte) 0 
(hbyte) 0 Zero-fill bits 7-0, zero-extend upper 16-bits 
(char) Ss Sign-extend bit 7 to upper 24-bits 
(short) Ss Sign-extend bit 15 to upper 16-bits 
(ushort) 0 0 Zero-extend to upper 16-bits 
(long) DATA DATA Load bits 3i—0. 


When storing CAU registers in memory or IO registers, the size of the operation may be specified to be 8-bit 
(byte, hbyte), 16-bit (short), or 32-bit (long). Long is the default and does not have to be specified. When the 
destination is an IO register, only the number of IO register bits specified by the size of the transfer are affected 
(8, 16, 32). The size specified for the transfer must be the same as the size of the register implemented in the 
DSP3210. The difference between (byte) and (hbyte) is the CAU register bits selected for the operation. The 





CAU register bits selected for each operation are described below: 






Source CAU Register Bits Selected 


Type 23-16 | 15-8 70 Description 
Specified 


(byte) x x Store bits 7—0 

(hbyte) Store bits 15—8 
(short) Store bits 15—O 
(long) Store bits 31—0 


When loading IO registers from memory, the size of the transfer may be specified to be 8-bit (byte), 16-bit (short), 
or 32-bit (long). Only the number of IO register bits specified by the size of the transfer are affected (8, 16, 32). 
When storing IO register to memory, the size of the transfer may be specified to be 8-bit (byte), 16-bit (short), or 
32-bit (long). The size specified for the transfer must be the same as the size of the register implemented in the 
DSP3210. 
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Description 
Ref 


rD= (short) N nz00 SET Reg Set- 16-bit sign extended to 32-bits 
rD= (ushort24) M =) SET24 Reg Set- 24-bit zero extended to 32-bits 
MEM, *L} =((byte), (hbyte), (short), (long) rS | — | STORE | Reg Store- MEM, *L are 8-, 16-, or 32-bit 

rD = {(char), (byte), (hbyte)} {MEM, *L} nzoo LOAD Reg Load- MEM, *L are 8-bits 

1D = {(short), (ushort)} (MEM, *L} nzoo LOAD Reg Load- MEM, *L are 16-bits 


Table 4-9 CA Data Move Group Instructions 





1D = (long) {MEM, “L} nz00 LOAD Reg Load- MEM, *L are 32-bits 
ba a a br 
rD = {(char), (byte), (hbyte)} iorS 

rD = {(short), (ushort)} iorS 


rD = (long) iorS 





























[alee Meaning 
10-r22, pc, pesh ‘One of 22 CAU registers, pc, or pc shadow register 


One of 22 CAU registers 


8-, 16-, or 32-bit memory location 
rP is amemory pointer (rP=r0—r22); when using the sp (r21), 

the size of the transfer must be long (++ and -- are +4 and —4), 
rlis an increment register (rl=r0-r22) 


One of the IO registers. The size of the selected register and 
transfer size must be the same. (#=0, 8, 10, 12, 14, 15) 


[io [| tor | One of the 1Oregisters. (#=0,8,10,12,14.15) 
[N | _‘t6bitnumber [2's complementnumber 
[MT eabitnumber[Unsignednumber 








tP, *rP++, *rP--, *rP++rl 












ior# 
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4.4 DSP3210 Programming 


This chapter explains how to combine the instructions presented in Chapter 3 to form valid DSP3210 programs. 
The programmer must be aware of the latencies that occur because of the pipelining of the device and the 
restrictions on the order of certain instructions. This chapter presents this information, along with a set of 
programming examples for encoding common functions. 


4.4.1 Restrictions 


This section describes three sequences of instructions that are not allowed. The assembler recognizes these 
sequences and generates a fatal error. 


4.4.1.1 Restriction 1 - Z-Field Pointer 


The instruction immediately following a DA instruction with a Z-field write, in which the Z-field pointer is post- 
modified and cannot reference the Z-field pointer except as another Z-field pointer of a DA instruction. This is a 
fatal error, and the assembler produces an error message. In the event that an instruction with a post-modified Z- 
field pointer follows a branch instruction, the assembler produces a warning message. 


Example 1: 


“r++ =a2=a2+ ‘al 
m=art+12 


ERROR: Using reg from preceding inst's Z-field pointer 
Example 2: 


*ri-- =a2=a2+ "al . 
a2=a2+*ri*al 


ERROR: Using reg from preceding inst's Z-field pointer 
Example 3: 


if (le) goto r20 
*r1=a2=a2+a3*al 


Warning: Use output pointer reg only as output pointer in DAU inst atdest after branch instruction. 
4.4.1.2 Restriction 2 - CAU and IO Register Store 


ACAU or IO register store instruction cannot immediately follow a DA instruction (multiply/accumulate or special 
function) that references memory in the Y field. This is a fatal error, and the assembler produces an error 
message. In the event that a DA instruction with a memory reference in the Y field immediately follows a branch 
instruction, the assembler produces a warning message. The affected CAU and IO Register Store instructions 
are: 

{MEM, *L} ={(byte), (hbyte), (short), (long)} rS 

iorD = {(byte), (hbyte), (short), [(long)]} rS 

MEM = {(byte), (short), (long)} iorS 


Recall that one form of the DA instructions is: 
[2] = aN = [aM {+,-} YX 
Example 1: 


a2=a2+*r2*a3 
"4=15 


ERROR: CAU reg or IO reg store can't follow DAU inst with 'Y' mem ref 
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a0 = ic(ibuf) 
‘4=15 


ERROR: CAU reg or IO reg store can't follow DA inst with "Y' IO register ref 
Example 3: 


if (ge) goto r3 
a2=a2+2* a3 


Warning: branch dest can't start with CAU reg store inst. 
4.4.1.3 Restriction 3 - CAU and IO Register Load 


A CAU register (r0—r22) that is being loaded from memory or from an IO register, or an IO register (ps, emr, pew, 
dauc) that is being loaded from memory cannot be referenced in the following instruction. The only exception is 
that a CAU register can be referenced as the Z-field pointer of a DA instruction. Note that the flags that result 
from this load cannot be referenced in the following instruction either. This is a fatal error, and the assembler 
produces an error message. If a CAU or IO register load follows a branch instruction, the assembler produces a 
warning message. A dauc load from memory must be followed by a CA instruction. The affected CAU and IO 
Register Load instructions are: 


1D = {(char), (byte), (byte), (short), (ushort), (long)} {MEM, *L} 
iorD = {(byte), (short), (long)} MEM 


Example 1: 


3=*r2 sg 
4=13+17 


ERROR:Cannot reference reg loaded in previous inst except as Z-field ptr 
Example 2: 


if (ne) goto r3 +8 
14 ="17++ 


Warning: branch destination can't start with inst referencing reg being loaded. 
4.4.2 Latency Effects 


The pipelined execution of Data Arithmetic DSP3210 instructions results in four latency effects that are listed in 
this section. See 8.2 Data Arithmetic Unit (DAU) for an explanation of the DAU pipeline. 
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4.4.2.1 Latency 1 - DA Instruction - Memory Writes 

When a DA instruction specifies a write to memory, the value written is not available to be read from that location 
until four instructions later (three-instruction latency). 

Example: 


1 *r3=a0=a0 
12 ‘3 =a3=a3 
(ye 

4. 
15 al="3 


The value read from memory in instruction five (15) is the value written in 11, not 12. 
4.4.2.2 Latency 2 - Accumulator as Multiplier Input 


When an accumulator, a0-a3, is used as an input to the multiplier, its value is established no sooner than three 
instructions prior to the multiply instruction (2-instruction latency). Note that this also applies to an 
accumulator using the X field of an instruction of the form: 


[Z =] aN = -Y {+,-} X 
Example: 


1 aQ=a0+*r1 **r2 
12 aQ=a0+al 


3 . 
14 a2=a0* a0 
15 al=a2+a0 . 


The value of a0 used in 14 is calculated in 11. The value of a0 used in I5 is calculated in 12. Note that the value of 
a2 used in I5 is calculated in 14 since there is no latency effect on accumulators used as inputs to the adder. 


4.4.2.3 Latency 3 - Branching 


When a CA Control Group instruction of the form if()goto, call, return, goto is executed, the instruction 
immediately following is also executed before the branch occurs. This is commonly referred to as a delayed 
branch. The ireturn instruction is different, and execution of the base-level program resumes in the following 
instruction cycle. 


Example 1: 


11 if (eq) goto over 
2 ned 
13 
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The instruction 2 is executed even if the condition is true and the branch is taken. If this is undesirable, a nop 
can be placed after the branch instruction, or if possible, the instructions can be rearranged. Because of this 
latency, a complex situation arises if successive branch instructions are coded as in the following example. 


Example 2: 


11 gotoA 
12 gotoB 


AL 


B: 
Cu 
The order of execution is 11, 12, A, B. If the instruction at A is not a goto, execution continues 


from B. If the instruction at A is goto C, the order of execution is 11, 12, A, B, C, 
and execution continues from C. Successive branch instructions are useful in some applications. 


4.4.2.4 Latency 4 - Conditional Branching on DAU Conditions 


A DAU condition tested by a conditional branch or conditional arithmetic/logic instruction is established by the last 
DA instruction that affects DAU flags no sooner than four instructions prior to the test (3-instruction latency). 
Example: 

11 aQ=a0+al . 

12 a2=a0*a2 

eee 

14. 

15 if (agt) goto next 
15 /* latent instruction */ 


The condition tested in I5 is established by the instruction at 11, not 12. 

Because of this latency effect, use the zero-latency ifalt, ifagt, and ifaeq functions where possible (see 4.2.2 DA 
Instructions - Special Functions). The DA condition tested by these conditional accumulator loads is established 
by the last DA instruction that affected the DAU flags. 

4.4.3 Common Programming Techniques 

The following are programming techniques common to the DSP3210. 

4.4.3.1 Subroutine Nesting 

The following example is intended only as an illustration of DSP3210 programming technique. The convention 
adopted is that the stack frame grows from high addresses toward low addresses, and that the stack pointer 


contains the first empty location. The latter uncommon convention saves one instruction of overhead in the push 
operation because the DSP3210 has only postincrement and not preincrement addressing. 
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The following example shows how to nest subroutines: 


call subr1 (r1) /* ret1 is stored in r1 */ 
2 /* latent instruction */ 


rett: 

subrt: . 
“*sp-- =n /* push ret! onto stack using stack pointer*/ 
call subr2 (r1) /* ret2 is stored in r1 */ 
f /* latent */ 

ret2: 
return (1) /* go to rett */ 
S M latent */ 

subr2: 
sp = Sp++ /* sp now points to top of stack = */ 
m=*sp /* restore linkage pointer (ret1) */ 
return (r1) /* go to ret2 */ 
¢ /* latent */ 


Alternatively, if only a small level of nesting is required and sufficient CAU registers are available, the different 


return addresses can simply be saved in different CAU registers. 


4.4.3.2 Passing Parameters 


Parameters may be passed to a subroutine if they immediately follow the latent instruction associated with the 


call: 


call subr (11); 

3 /* latent instruction */ 
parameter #1 for subr 

parameter #2 for subr 

parameter #3 for subr 

parameter #4 for subr 


QNOnPON> 


tc. 


The subroutine should use *r1++ to retrieve the parameters. After all four parameters are retrieved, a return (r1) 


is executed that resumes execution at line 7. 
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4.4.3.3 Quick Interrupt Facility 


Each entry in the interrupt vector table may contain two instructions. In general, these may be used for a goto 
and the delayed branch latent instruction. Alternatively, by using these two locations for an instruction followed by 
an ireturn, an interrupt routine with an overhead of three instruction cycles can be accomplished. 


The interrupt operation performed by the DSP3210 while processing the instruction stream 11, 12, and I3 is as 
follows: 


" /* program instruction #1 a 

12 /* program instruction #2 ie 

goto evip + offset /* overhead branching to interrupt */ 

nop /* vector table plus the offset is apes by the DSP3210 basedon = */ 
/* the interrupt source 

IR1 /* useful interrupt routine fenoien tif 

ireturn /* no latent instruction executed after ireturn sf 

13 /* program instruction #3 sf) 


IR1 is the useful interrupt routine instruction. IR1 cannot be a DA instruction. 
4.5 Common Signal Processing Functions 


This section presents instruction sequences for common functions performed in digital signal processing. Many 
more example programs are contained in the DSP3210 Application Software Library Reference Manual. All 
instructions require one 32-bit word for storage and execute in one instruction cycle. Note that the time to 
execute one instruction is affected by conflict and external waitstates. 


4.5.1 Standard Signal Processing Functions 


The following functions illustrate the source code syntax and programming techniques for the DSP3210. All 
instruction sequences may be preceded by the do instruction for repeated execution. 


(a) Convolution (FIR filter) 
k=N-1 
y(n) = © hfk] * x(n k) 
k=0 


(1) al sat + (B44 = "1444) * 124+ 


(b) LMS update for adaptive filtering 

h(k+1) = h(k) + 2 * p* e(k) * x(k) 
(1) r++ = at =*r2+a0* "44+ 
(c) Second-order biquad IIR filter 


wi(n) = vi-1(N) - d(i, 1)s i4 (n-1) - d(1, 2)s i2 (n-1) 
v i(n) = n(i, O)w j(n) + n(i, 1)s j4(N-1) + n(i, 2)s ja(n-1) 


Si1(n) = w (n) 

$ ia(n) = s j4(n-1) 
(1) a0 =* 15 * *r2++ /* input =" K1 af 
(2) aQ=a- 1344 * 124+ I d(i, 1) term vi 
(3) *r44++ = a0 =a0 - *13--* “r++ /* d(i, 2) term */ 
(4) a0 = a0 + (*r4++ = *13++) * "r++ /* n(i, 1) term */ 
(5) *r6 = a0 = a0 + “134+ * *12++ /* n(i, 2) term % 
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(d) FFT Butterfly 
(1) a0 = *r14++117 + a2 * “1344117 fT=R(i)*Ur* R(k) */ 
(2) *rit+4r17 = a0 =a0-*+4r19%a3_ Ri) = T= T- Ui“ I(k) */ 
(3) al = “r1+4r19 + a3 * "14417 PT=\i)+Ui*R(k) ‘7 
(4) "11244117 = a0 =-a0+ “4417 * "16 /* (RK) =-T+2°R(i) */ 
(5) “teat = al =al+"tert7 "a2 /* Mi) =T=T+ Ur" M(k) */ 
(6) "11244115 = al =-al+ “rt44r15 * 6 Mk) +-T+ 2°) 7 
(e) In-place bit reversal 
A: m-2 /* compare true and bit-reversed */ 
if(ge) pegoto B /* exchange if 1 <r2 ah 
men /* latent instruction bi 
"1144117 = a0 = *12+4r17 /* complex exchange oh 
“1441 
“r2+4r1 
*r2+4119 = a0 = “1544119 
B: m=an+4 
if(r18-- >= 0) pegoto A 
=r2#r16 /* # is carry reverse add ¥) 


(f) Inverse using the seed instruction 
a3 = seed(*r1) 
K 






(2) 6B 
(3) 14 = result 
(4) aO = “r++ 4.03 * 11 
(5) al = *r3++ * a3 
(6) nop 
(7) a0 = *r3++ + a0 * ad 
(8) a2Q=al * "r++ 
(9) nop 
(10) a0 =a0*al 
(11) al = *r3--r17 + a0 * a2 
(12) nop 
(13) nop 
(14) *r444+ =a0=a0-at* a0 
_K: float -1.60138661 
float 0.82371354 
float 0.85221935 
float -1.0 


4.5.2 Nonlinear Functions 


ab 
ai 
rh 


/* r1 points to input table 
/* 3 points to constants 
/* r4 points to output table 
*aQ=-1.6-x*y 
ai=0.8*x 


vii 
sh 


/* a0 =31-3.2*x*y + x*2*y*2*/ 

a2=0.8*x*y a 
fa0=2.8-2.6*x'2* y+ 
TERNS A yA2 
fal=28*x*y-2.6°x*2* 
[* y2 + 0.8 * x43 * y*3 


0:8 ook 
oy 

/ 
7 


/* result is in a0 df 


Several nonlinear functions are programmed using the ifalt or ifagt special functions (see 4.2.2 DA Instructions 
Special Functions). The DSP3210 programs for some of these nonlinear functions are included here. 


(a) Absolute Value or Full-Wave Rectification 


a0 = -a2 
a2 = ifagt(a0) 


(1) 
(2) 


The value in a2 is replaced by its absolute value. 
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Instruction Set 
(b) Signum 
(1) aQ=*rt 
(2) alt=" 


(3) *r3 = a0 = ifalt(*r2) 


The signum, or sign function, returns a +1 for positive operands and a —1 for negative operands. The above 
program assumes that the memory location pointed to by r1 contains a floating-point +1 and that the location 
pointed to by r2 contains a floating-point -1. The value in the memory location pointed to by r3 is replaced by its 
signum. (Note: duac[6]=0, therefore Z-field writes are unconditional). 


(c) Minimum 


(1) aQ=-a1 +a2 
(2) at =ifalt(a2) 


Accumulator a1 is written with the smaller of the two values stored in a1 and a2. 
(d) Maximum 


(1) a0=al-—a2 
(2) at = ifalt(a2) 


Accumulator a1 is written with the larger of the two values in a1 and a2, 
(e) Half-Wave Rectification 


(Q) 0 at="r2 
(2) al =ifalt(*r1) _/* r1 points to a floating-point zero */ 


The contents of memory location r2 are halftwave rectified and then stored in a1. 


(f) Two-Sided Limiter 





(1) /* initialize a2 with negative limit fh 
(2) /* compare a0 with positive limit bs 
(3) 7° if aO>limit, replace a0 with limit */ 
(4) /* compare a0 with negative limit th 


(5) a0 =ifalt(a2) —_/* if aO<limit, replace a0 with neg. limit */ 


The two-sided limiter restricts the magnitude of the input. In the example above, it is assumed that the limit is 
stored in a memory location pointed to by r1. The contents of a0 are limited, and the result is stored back into a0. 
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4.6 Detailed Description of Instruction Set 


This section contains a detailed description of the DSP3210 instruction set presented in sections 4.2 and 4.3. The 
description consists of a short explanation of the operation performed by the instruction, the flags affected by the 
instruction, a description of the binary encoding, and an example. The instructions are organized alphabetically 
by the instruction reference name. 


Instruction Reference 


ADD 


Instruction Reference 


IFALT 4-54 











































ADD-CARRY REV INCR/DECR 

AND INT16 

AND-COMPLEMENT INT32 

BIT TEST IRETURN 
LOAD 


LOAD-IOR 





EXCLUSIVE OR 
FADD-STORE 















































FADD-TAP ROTATE LEFT 
FLOAD-STORE ROTATE RIGHT 
FLOAT16 ROUND 
FLOAT32 SEED 
FMULT-ACC-STORE SET 
FMULT-ACC-TAP SET24 
FMULT-ADD-STORE SFTRST 
FMULT-STORE SHIFT LEFT 
FMULT-TAP SHIFT-OR 
GOTO SHIFT RIGHT 






GOTO-COND 
GOTO-LOOP 


SHIFT RIGHT-ARITH 
STORE 

STORE-IOR 
SUBTRACT 

WAITI 
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ADD 


INSTRUCTION: __rD = {[(long)], (short)} rS3 +N (1) 
[if(COND)] rD = {[(long)], (Short)} rS1 + rS2 (2) 
[if(COND)] rD = {[(long)], (short)} rS * 2 (3) 
[if(COND)] rD = {[(long)], (short)} rS (4) 


CLASS: CA-Arithmetic/Logic 
CAU FLAGS AFFECTED: nzve 
DESCRIPTION 


These instructions perform integer add operations. The items in brackets [] are optional. A 32-bit operation, 
(long), is the default size and does not have to be specified. (short) indicates that a 16-bit operation is desired. A 
(short) operation sign extends the result from bit 15 to bit 31. Flags are set according to the 16- or 32-bit 
operation flag rules (see Table 4-1), as specified by the instruction. If the instruction is conditionally executed and 
the COND is true, the result is placed in the register rD, and the flags are set. If the instruction is conditionally 
executed and the COND is false, neither the register rD nor the flags are affected. 


(1) is a three-operand add instruction with an immediate operand. The contents of rS are added to a value N, and 
the result is placed in rD. N is a 16-bit 2's complement number that is MSB-extended to 32 bits. This instruction 
is useful in computing addresses by the base plus displacement technique. 


(2) is a 3-register add instruction. The contents of register rS1 and register rS2 are added, and the result is 
placed in register rD. 


(3) is a multiply by 2 instruction. This instruction multiplies the contents of register rS by two, and the result is 
placed in register rD. This instruction is performed by adding rS to itself (rS+rS). This instruction should be used 
for a left shift by 1 when setting of the carry flag is desired. 


(4) is an assignment instruction. This instruction assigns the contents of register rS to the register rD. The 
operation actually performed is rD = rS+r0. 


[1 | 022] (hare-wired zero), or one of 22 CAU registers __——— 

ro—122 

| _N | 16-bitnumber_[2scomplementinteger 
1S3 


r0—r22, pc, pesh | r0 (hard-wired zero), or one of 22 CAU registers, the program 
counter, or the program counter shadow register 
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ADD 


INSTRUCTION (cont.): —_rD = {[(long)], (short)} rS3 + N 


4-22 


[if(COND)] rD = {{(long)], (short)} rS1 + rS2 
[if(COND)] rD = {[(long)], (short)} rS * 2 
{if(COND)] rD = {[(long)}, (short)} rS 







Replacement Table for CA Instructions 
(COND) - CAU Conditions 


| Condition | Flagt | Meaning 
Condition is always false. 
true Not applicable _| Condition is always true. 

Result is nonnegative (plus) 
recy zen 3 n=1 Result is negative (minus) 
Result not equal to zero 
Result equal to zero 


ve Overflow clear, no overflow 
vs Overflow set, overflowed 





Carry clear, no carry 

Carry set, carry 

Greater than or equal to zero 
Less than zero 

Greater than zero 

Less than or equal to zero 
Greater than (unsigned number) 
Less than (unsigned number) 


=@ 
=o 
ais 
pas 
<< 
wot 
ao 


z|(n‘v)=0 
z|(n4v)=1 














Replacement Table for CA Instructions 
(COND) - DAU Conditions 
Meaning 





Not equal to zero 

Equal to zero 

Greater than or equal to zero 
Less than zero 

Overflow clear, no overflow 
Overflow set, overflowed 
Underflow clear, no underflow 





Underflow set, underflowed 
Greater than zero 
Less than or equal to zero 


ADD 


INSTRUCTION (cont.): —_rD = {{(long)], (short)} rS3 + N 
[if(COND)] rD = {{(long)], (short)} rS1 + rS2 
[if(COND)] rD = {{(long)], (short)} rS * 2 
[if(COND)] rD = {[(long)], (short)} rS 


Replacement Table for CA Instructions 
Jer Gonatton | ees rag ite | eee eee Meaiigs eer ea 
ibe ibf=0 Input buffer empty 
ce onl ibf=1 Input buffer full 
ou 
obi Output buffer full 
syc S| Sync signal low 
Serial frame boundary clear 
IRON pin is logic one (1). 
IRON pin is logic zero (0). 


IR1N pin is logic one (1). 
IRAN pin is logic zero (0). 














LATENCY 


A DAU condition tested by a conditional branch or conditional arithmetic/logic instruction is established by the last 
DA instruction that affects DAU flags no sooner than four instructions prior to the test. For details on latency 
effects refer to Secion 4.4.2. 


INSTRUCTION FORMAT (1) (Format Sa—b) 


Bit 31__| 30-26 25—21 20—16 15—0 
Field (Lee) (ean ae (Si a (a | a 


INSTRUCTION FORMAT (2-4) (Format 6a—b) 


Bit 31 30—25 24—21 20—16 15—11 10—5 4-0 
Field El BUN ee ES Se ea el 
E Field: Specifies the size (16/32) of the ALU operation. 
1D Field: Specifies the destination register. 
rS1, rS2, rS3 Fields: Specifies the source register. 
N Field: Specifies a 16-bit 2's complement integer that is MSB extended to 32-bits. 
F Field: Specifies the ALU function to be performed (+). 
C Field: Specifies the condition being tested. 


EXAMPLE (For field encodings refer to Chapter 10.) 


r11= (short) r1 +32 





Bit 31__| 30—26 25—21 20—16 15—0 
Encoding [0 | 00101] 01011 | 00001 | 0000000000100000 
Meaning [{shont) [tt Tt 0x0020 
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ADD-CARRY REV 


INSTRUCTION: _rD = {[(long)], (short)} rD #N (1) 
[if(COND)] rD = {[(long)], (short)} rS1 #rS2 (2) 


CLASS: CA-Arithmetic/Logic 

CAU FLAGS AFFECTED: nzc, v=0 

DESCRIPTION 

These instructions perform carry reverse ADD operations, i.e., an ADD operation with the carry propagating from 
MSB to LSB. The items in brackets [ ] are optional. A 32-bit operation, (long), is the default size and does not 
have to be specified. (short) indicates that a 16-bit operation is desired. The (short) operation propagates the 


carry from bit 15 to bit 0 and sign extends the result from bit 15 to bit 31. Flags are set according to the 16- or 32- 
bit operation flag rules (see Table 4-1), as specified by the instruction. 





(1) This instruction performs a carry reverse add operation on the contents of register rD with the value N and the 
result is placed in the register rD. N is a 16-bit 2's complement number MSB-extended to 32-bits. 


(2) This instruction performs a carry reverse add operation on the contents of register rS1 with the contents of 
register rS2, and the result is placed in the register rD. If the instruction is conditionally executed and the COND 
is true, the result of the operation on rS1 and rS2 is placed in the register rD, and the flags are set. If the 
instruction is conditionally executed and the COND is false, neither the register rD nor the flags are affected. The 
COND field is the same as for the add operations (See page 4-21 ADD). When rS1 = rS2 (rS), this instruction 
performs a right shift by one binary place and the LSB of rS is placed in the c flag. 


Instruction Replacement Table 
Replacement 
[| —_r0—r22 _| r0 (hardwired zero), or one of 22 CAU registers 
1S1, rS2 10 (hard-wired zero), or one of 22 CAU registers 
[| N [| __16-bitnumber [2's complement integer 


INSTRUCTION FORMAT (1) (Format 6c—d) 


Bit 31 30—25 24—21 | 20-16 15—0 
Gs Se el ESO a a An a a ae) 


INSTRUCTION FORMAT (2) (Format 6a—b) 











Bit 31 30—25 24—21 20—16 15—11 10—5 4—0 
Field BES ROE RR ie ed BS a] 
E Field: Specifies the size (16/32) of the ALU operation. 
1D Field: Specifies the destination register. 


rS1,rS2 Fields: Specifies the source register. 

Specifies a 16-bit 2's complement integer that is MSB extended to 32-bits. 
Specifies the ALU function to be performed (#). 

Specifies the condition being tested. 





EXAMPLE (For field encodings refer to Chapter 10.) 





2= 14# 14 

Bit 31 30—25 24—21 | 20-16 | 15-11 10—5 4-0 

Encoding [_1__ [| 001100 | 0011 [00010 [01110 [000001| 00100 
Meaning | (long) fo or 2 i re | 
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AND 


INSTRUCTION: __rD = {[(long)], (short)} rD & N (1) 
[if(COND)] rD = {[(long)], (Short)} rS1 & rS2 (2) 


CLASS: CA-Arithmetic/Logic 
CAU FLAGS AFFECTED: nz, v=c=0 
DESCRIPTION 


These instructions perform bit-by-bit, logical AND operations. The items in brackets [ ] are optional. A 32-bit 
operation, (long), is the default size and does not have to be specified. (short) indicates that a 16-bit operation is 
desired. Flags are set according to the 16- or 32-bit operation flag rules (see Table 4-1), as specified by the 
instruction. 


(1) This instruction performs a bit-by-bit, logical AND operation on the contents of register rD with the value N and 
the result is placed in the register rD. N is a 16-bit 2's complement number that is MSB-extended to 32 bits. 


(2) This instruction performs a bit-by-bit, logical AND operation on the contents of register rS1 with the contents of 
register rS2, and the result is placed in the register rD. If the instruction is conditionally executed and the COND 
is true, the result of the operation on rS1 and rS2 is placed in the register rD, and the flags are set. If the 
instruction is conditionally executed and the COND is false, neither the register rD nor the flags are affected. The 
COND field is the same as for the add operations (See page 4-21 ADD). 


[Term [Replacement | SSCMeaning i 
[_N[teitnumber [2s complementinteger 


INSTRUCTION FORMAT (1) (Format 6c—d) 


Bit 31 30—25 24-21 | 20-16 15—0 
Field Se a ee ee ee eee 


INSTRUCTION FORMAT (2) (Format 6a—b) 




















Bit 31 30—25 24—21 20—16 f= 10—5 4—0 
Field Be Rn en es Ee ee 
E Field: Specifies the size (16/32) of the ALU operation. 
1D Field: Specifies the destination register. 
1S1, rS2 Fields: Specifies the source register. 
N Field: Specifies a 16-bit 2's complement integer that is MSB extended to 32-bits. 
F Field: Specifies the ALU function to be performed (&). 
C Field: Specifies the condition being tested. 


EXAMPLE (For field encodings refer to Chapter 10.) 


if(cc) r1 = (short) r1 & r2 








Bit 31 30—25 24—21 | 20-16 | 15—11 10—5 4-0 
Encoding [0 | 001100 | 1110 | 00001 | 00001 [001000] 00010 
Meaning [eee | TY Cs as 
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AND-COMPLEMENT 


INSTRUCTION: _[if (COND)] rD = {[(long)], (short)} rS1 &~ rS2 
CLASS: CA-Arithmetic/Logic 

CAU FLAGS AFFECTED: nz, v=c=0 

DESCRIPTION 


This instruction performs a bit-by-bit, logical AND operation on the contents of register rS1 with the complement of 
the contents of register rS2. The result is placed in the register rD. The items in the brackets [ ] are optional. A 
32-bit operation, (long), is the default size and does not have to be specified. (short) indicates that a 1 6-bit 
operation is desired. For the (short) operation, the result is sign extended from bit 15 to bit 31. Flags are set 
according to the 16- or 32-bit operation flag rules (see Table 4-1), as specified by the instruction. If the instruction 
is conditionally executed and the COND is true, the result is placed in the register rD, and the flags are set. If the 
instruction is conditionally executed and the COND is false, neither the register rD nor the flags are affected. The 
COND field is the same as for the add operations (See page 4-21 ADD). 


[| _=122 [10 (hard-wired zero), or one of 22 CAU registers | 
[_N | _ 16-bit number 


INSTRUCTION FORMAT (Format 6a—b) 







Bit 31 30—25 24—21 20—16 15—11 10—5 4—0 
Field SS SOG SS nes Se Ses eee 
E Field: Specifies the size (16/32) of the ALU operation. 
rD Field: Specifies the destination register. 
rS1,rS2 Fields: Specifies the source register. 
F Field: Specifies the ALU function to be performed (&~). 
C Field: Specifies the condition being tested. 


EXAMPLE (For field encodings refer to Chapter 10.) 


15 = (short) r4 &~ 3 
Bit 31 30—25 24—21 20—16 15—11 10—5 4—0 
0 


Encoding [0 | 001100 | 0110 | 001707 | 00100 [000001] 00011] 
Meaning [sho] SS) EST (trey) P| 
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BIT TEST 


INSTRUCTION: —{{(long)], (short)} rD & N (1) 
[if(COND)}] {[(long)], (short)} rS1 & rS2 (2) 


CLASS: CA-Arithmetic/Logic 
CAU FLAGS AFFECTED: nz, v=c=0 
DESCRIPTION 


These instructions perform bit-by-bit, logical AND operation, but the results are not stored. These instructions can 
be used for bit tests. The items in brackets [] are optional. A 32-bit operation, (long), is the default size and does 
not have to be specified. (short) indicates that a 16-bit operation is desired. Flags are set according to the 16- or 
32-bit operation flag rules (see Table 4-1), as specified by the instruction. 


(1) This instruction performs a bit-by-bit, logical AND operation on the contents of register rD with the value N. N 
is a 16-bit 2's complement number that is MSB-extended to 32 bits. 


(2) This instruction performs a bit-by-bit, logical AND operation on the contents of register rS1 with the contents of 
register rS2. If the instruction is conditionally executed and the COND is true, the flags are set. If the instruction 
is conditionally executed and the COND is false, the flags are not affected. The COND field is the same as for the 
add operations (see page 4-21 ADD). 


Replacement Meaning 
bp] o—22 | ardwred zero), orone of 22 CAU registers | 
[__N | __16-bitnumber [2's complement integer 


INSTRUCTION FORMAT (1) (Format 6c—d) 


Bit 31 30—25 24-21 | 20—16 15—0 
el ee) eae] CTS Da a Sal a | 


INSTRUCTION FORMAT (2) (Format 6a—b) 


Bit 31 30—25 24—21 | 20-16 | 15—11 10—5 4-0 
Field E 001100 F 1D rS1 Cc 



























E Field: Specifies the size (16/32) of the ALU operation. 

rD Field: Specifies the destination register. 

rS1,rS2 Fields: Specifies the source register. 

N Field: Specifies a 16-bit 2's complement integer that is MSB extended to 32-bits. 
F Field: Specifies the ALU function to be performed (& (bit test)). 

C Field: Specifies the condition being tested. 


EXAMPLE (For field encodings refer to Chapter 10.) 






















15 & 13 

Bit 31 30—25 24—21 | 20-16 | 15—11 

Encoding 001100 ee 00101 | 000001 
Meaning [_(long) _| & (bit test 5 true 
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CALL 


INSTRUCTION: call {N, M, rB, rB+N} (rM) 
CLASS: CA-Control 
CAU FLAGS AFFECTED: None 


DESCRIPTION 


This instruction transfers control of the program counter (pc) to a location specified in the braces {}. When this 
instruction is executed, the value of the program counter (pc) is stored in the specified rM register, and the value 
in the braces {} is moved to the pc. Register rM contains the address of the instruction that follows the latent 
instruction. The values in the braces {} specify an address N (16-bit MSB extended to 32-bits), an address M (24- 
bit zero extended to 32-bits), a CAU register rB, or a value in a CAU register rB plus a displacement value N (16- 
bit MSB extended to 32-bits). N is always 16 bits and is sign-extended to 32 bits for all operations. M is 24-bits 
and is zero-extended to 32-bits. When using pc as rB, the syxtax: 

pecall label 
may be used to perform a pc-relative subroutine call. The linker will compute the appropriate N (offset) value. 
The call M (rM) instruction is a format 8c instruction. All others are format 4a. This instruction disables 
interrupts for one instruction cycle. 


Instruction Replacement Table 
Replacement 


B pe, ro—r22 The program counter, r0 (hard-wired zero), or one of 
22 CAU registers 








‘One of 22 CAU registers 
16-bit number _| 2's complement integer 
24-bit number _| Unsigned integer 


LATENCY 


When the above instruction is executed, the instruction immediately following is also executed before the branch 
occurs. For details on latency effects refer to Section 4.4.2. 


INSTRUCTION FORMAT (Format 4a) 


Bit 31—26 25—21 20—16 15—0 
Field cooioo | mi [ 68 | 


tM Field: Specifies the register used to store the return address. 

rB Field: Specifies the base address register. When N specifies 0, rB is the 
destination address. 

N Field: Specifies a 16-bit 2's complement integer used as an offset value for the 
base address. When rB specifies r0, the N field is an absolute address. 


INSTRUCTION FORMAT (Format 8c) 
Bit 31-29 28-21 20—16 15—0 
Field iS ae ee ee | 


™ Field: Specifies the register used to store the return address. 

NE Field: Specifies the most significant 8-bits of the 24-bit integer. NE concatonated 
with N forms the 24-bit unsigned integer (NE || N). 

N Field: Specifies the least significant 16-bits of the 24-bit unsigned integer. 
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CALL 


INSTRUCTION (cont.): call {N, M, rB, rB+N} (rM) 
EXAMPLE (For field encodings refer to Chapter 10.) 


call pc+0x0111 (r20) 


Bit 31—26 25-21 20—16 15—0 
Encoding 000100 711000 | 071111 [0000000100010001 
a a a a a hh 
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COMPARE 


INSTRUCTION: __{{(long)],(short)} rD - N (1) 
[if(COND)] {{(long)],(short)} rS1-rS2 (2) 


CLASS: CA-Arithmetic/Logic 
CAU FLAGS AFFECTED: nzvc 
DESCRIPTION 


These instructions perform subtract operations, but the results are not stored. These instructions are used in 
compare operations. The items in brackets [] are optional. A 32-bit operation, (long), is the default size and does 
not have to be specified. (short) indicates that a 16-bit operation is desired. A (short) operation sign extends the 
result from bit 15 to bit 31. Flags are set according to the 16- or 32-bit operation flag rules (see Table 4-1), as 
specified by the instruction. 


(1) This instruction subtracts the value N from the contents of register rD. N is a 16-bit, 2's complement number 
that is MSB-extended to 32 bits. 


(2) This instruction subtracts the contents of register rS2 from register rS1. If the instruction is conditionally 
executed and the COND is true, the flags are set. If the instruction is conditionally executed and the COND is 
false, the flags are not affected. The COND field is the same as for the add operations (see page 4-21 ADD). 


Instruction Replacement Table 
Term Replacement Meaning 


[1 | —_r0—r22__—_—[ r0 (hard-wired zero), or one of 22 CAU registers 
r0—r22 TO (hard-wired zero), or one of 22 CAU registers 
| N | 16-bitnumber | 2's complement integer 


INSTRUCTION FORMAT (1) (Format 6c—d) 


Bit 31 30—25 24—21 | 20-16 15—0 
fh | Sal Sah Se Ee a ay 


INSTRUCTION FORMAT (2) (Format 6a—b) 


























Bit 31 30—25 24—21 20—16 15—11 10—5 4—0 
Field [ _E [| 001100 (=o en Sr C 1S2 
E Field: Specifies the size (16/32) of the ALU operation. 
rD Field: Specifies the destination register. 
1S1,rS2 Fields: Specifies the source register. 
N Field: Specifies a 16-bit 2's complement integer that is MSB extended to 32-bits. 
F Field: Specifies the ALU function to be performed (- (compare)). 
C Field: Specifies the condition being tested. 


EXAMPLE (For field encodings refer to Chapter 10.) 
















(long) r1 - 15 

Bit 31 30—25 24—21 20—16 15—0 

Encoding 001101 0111 0000000000001111 
Meaning [_(long) | compare fal 15 
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DO 


INSTRUCTION: — do K,{L,rM} (1) 
dolock K,{L,rM} (2) 
doblock {L,rM} (3) 


CLASS: CA-Control 
CAU FLAGS AFFECTED: None 
DESCRIPTION 


This instruction performs the next K + 1 instructions L + 1(or rM + 1) times. The argument K is an integer from 0 
to 127. The second argument can either be a register rM, or a value from 0 to 2047. If a register rM is specified, 
only the lower 11 bits of the register are used to specify the loop count. When K = 0, the instruction to be 
repeated is cached on-chip for the duration of the do loop. This eliminates the repeated fetch of the same 
instruction. 


(1) The do instruction initiates a do loop sequence. DMA cycles and interrupts are supported during do loops. 
For interrupts, the iteration and instruction count registers are shadowed. 


(2) The dolock instruction is used to assert an external control signal, LOCKN, for the duration of the do loop. 
Interrupts and DMA cycles are disabled during a dolock sequence. 


(3) The doblock instruction is used to decompose a block move into quad-word transfers. At the beginning of 
each quad-word, the BLMN signal is asserted and the MSO—MS3 pins indicate a quad-word transfer size (Note 
that BLMN and MSO—MS3 are configurable, see Section 6.1 - Bus Interface Signal Descriptions). The only 
instruction allowed following the doblock instruction is a0=(Z=Y)*a0, therefore K = 0 is implicit. At the completion 
of the do block sequence, the DAU flags are undefined. Interrupts and DMAs are disabled during each quad- 
word transfer. See 6.6.3 Quad-word Block Move Indicators. 


Instruction Replacement Table 


[term | Replacement [Meaning 
[ee Keo 0—127 immediate value from 0 to 127 
(if 







O—2047 immediate value from 0 to 2047 
One of 22 CAU registers 


LATENCY 


The do, dolock, and doblock instructions have no latency properties. Instructions encompassed by a do loop 
execute with the same latencies and restrictions therefore interactions between the last instruction of the loop and 
the first instruction of the loop should be carefully inspected. For details on latency effects refer to Section 4.4.2. 


INSTRUCTION FORMAT (Format 3b) 
Bit 31-25 __| 24 | 23 22—18 711 10—0 

Field TOOCETIO NE iL a) ee, ea reer Ga 
INSTRUCTION FORMAT (Format 3c) 


Bit 31-25 __| 24 | 23 22—18 17—11_|_ 10-5 
Field TS a (Ses LES 


B,M Fields: B specifies the dolock instruction, and M specifies the doblock instruction. 





» 
° 


K Field: Specifies the number of instructions to be repeated (K field is encoded 
with one less than the desired number). 

L Field: Specifies the iteration count (L field is encoded with one less 
than the desired count). 

1 Field: Specifies the register containing the iteration count (the register 


value is one less than the desired iteration count). 
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DO 


INSTRUCTION (cont.): do K,{L,rM} 
dolock K,{L,rM} 
doblock {L,rM} 


EXAMPLE (For field encodings refer to Chapter 10.) 


do 4, 120 
Bit 31—25 24 | 23 22—18 


17—11 10—5 


Encoding 7000111] 0{0| 00000 | 0000100 | o00000| 11000} 
miss [paste eat an 





Meaning 
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DSP 


INSTRUCTION: = [Z =] aN = dsp(Y) 
CLASS: DA-Special Function 

DAU FLAGS AFFECTED: NZVU 
DESCRIPTION 


This instruction converts the Y operand, an IEEE format 32-bit floating-point number, to a DSP3210 format 
floating-point number, and the result is placed in an accumulator (aN). If the exponent of the IEEE number is 255, 
it is treated as positive or negative infinity according to its sign. If positive, the converted DSP3210 number is the 
largest positive representable number in the DSP3210 format, namely Ox7fffffff. If negative, the converted 
DSP3210 number is the most negative number representable in the DSP3210 format, namely Ox800000ff. If the 
exponent of the IEEE number is zero, the converted DSP3210 number is 0x00000000, or positive zero. The 
DSP3210 format does not recognize a negative zero. The denormalized IEEE numbers are not supported. If the 
IEEE format input operand is NaN (not a number), the NaN error condition is set. If a Z field is specified, the 
result is also stored to a word address in memory. Note that the Y operand can not come from an 
accumulator. 






























IEEE-to-DSP C Special Cases 
[aw [own] commons 
yt ZNUV, NaN Comments 
s=0; exp=255; mant=0 | Ox7fffffff Treated as +c 
=255; mant=0 | 0x800000ff Treated as -co 





Non-zero mantissa treated as +2 
Non-zero mantissa treated as -co 
Denormalized number treated as zero 
Denormalized number treated as zero 


Ox7 fffffff 
Ox800000ff 
0x00000000 
0x00000000 










$=1; exp=255; mant +0 
s=0; exp= 0; mant #0 
s=1; exp= 0; mant +0 






32-bit memory location. ++ is +4 and -- is —4. 


rP refers to ri—14 and is used as a memory pointer. 

rl refers to r15—19 and is used as an increment register. 
32-bit memory location. ++ is +4 and -- is —4. 

rP refers to ri—14 and is used as a memory pointer. 

rl refers to r15—19 and is used as an increment register. 





LATENCY 
For latency effects refer to Section 4.4.2. 
INSTRUCTION FORMAT (Format 5 - Refer to [Z=] aN = ieee(Y)) 


EXAMPLE (For field encodings refer to Chapter 10.) 
“r++ = a1 = dsp(*r9++) 


Bit 31-27 26—23 2221 20—14 13—7 6—0 
Encoding 0000000/10011117 0001111 


a a en 
CS | as Ce a a RECT 
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EXCLUSIVE OR 


INSTRUCTION: _rD = {{(long)],(short)} rD “ N (1) 
[if(COND)} rD = {{(long),(short)]} rS1 * rS2 (2) 


CLASS: CA-Arithmetic/Logic 
CAU FLAGS AFFECTED: nz,v=c=0 
DESCRIPTION 


These instructions perform bit-by-bit, logical XOR (exclusive OR) operations. The items in brackets [] are 
optional. A 32-bit operation, (long), is the default size and does not have to be specified. (short) indicates that a 
16-bit operation is desired. Flags are set according to the 16- or 32-bit operation flag rules (see Table 4-1), as 
specified by the instruction. 

(1) This instruction performs a bit-by-bit, logical XOR operation on the contents of register rD with the value N and 
the result is placed in the register rD. N is a 16-bit 2's complement number that is MSB-extended to 32 bits. 


(2) This instruction performs a bit-by-bit, logical XOR operation on the contents of register rS1 with the contents of 
register rS2, and the result is placed in the register rD. If the instruction is conditionally executed and the COND 
is true, the result of the operation on rS1 and rS2 is placed in the register rD, and the flags are set. If the 
instruction is conditionally executed and the COND is false, neither the register rD nor the flags are affected. The 
COND field is the same as for the add operations (see page 4-21 ADD). 


Term 
[| 0120 | PO hardwired zero), or one of22 CAUregisters | 
TnL tebitnumber [2s complementinteger 


INSTRUCTION FORMAT (Format 6c—d) 


Bit 31 30—25 24—21 | 20-16 15—0 
Cea 0 = MC | a 


INSTRUCTION FORMAT (Format 6a—b) 












Bit 31 30—25 24—21 20—16 ao 14 10—5 4—0 
Field ee QO (RT 
E Field: Specifies the size (16/32) of the ALU operation. 
rD Field: Specifies the destination register. 
rS1,rS2 Fields: Specifies the source register. 
N Field: Specifies a 16-bit 2's complement integer that is MSB extended to 32-bits. 
F Field: Specifies the ALU function to be performed (’). 
C Field: Specifies the condition being tested. 


EXAMPLE (For field encodings refer to Chapter 10.) 


if(irOc) r2 = 3% 14 


















Bit 31 30—25 24—21 | 20-16 | 15-11 10—5 
Encoding [__1 [| 001100 [1000 | 00010 | 00011 | 101100] 00100 
Bhosinng in (loragy | suena] ref | em FS |e aT ear 
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FADD-STORE 


INSTRUCTION:  [Z =] aN =[-] Y {+,-} X 

CLASS: DA-Multiply/Accumulate 

DAU FLAGS AFFECTED: NZVU 

DESCRIPTION 

In this instruction, the operand specified by the X field is added to or subtracted from the operand or minus the 


operand specified by the Y field and the result is stored in an accumulator (aN). If a Z field is specified, the result 
is also be stored to a word address in memory. The operation actually performed is [Z =] aN = [-] Y {+,-} 1.0° X. 














Instruction Replacement Table 


Meaning 
[aN _|[a0—a3__—_| One of the four DAU accumulators 


xo¥ *rP, *rP++, 32-bit memory location. ++ is +4 and -- is —4. 
“rP--, *rP++rl, rP refers to ri—14 and is used as a memory pointer. 


rl refers to r15—19 and is used as an increment register. 


One of the four DAU accumulators. 
LATENCY 





























32-bit memory location. ++ is +4 and -- is —4. 
rP refers to ri—14 and is used as a memory pointer. 
tl refers to r15—19 and is used as an increment register. 





For latency effects refer to Section 4.4.2. 


INSTRUCTION FORMAT (Format 1) 


Bit 31—29 | 28-26 | __25 24 23 | 22-21 | 20—-14| 13~7 | 6-0 
Fed: [Goons [eaMict) 028 Reon) St) SN a) 8X 8 | eyes Bezier 


MField: — Specifies a constant value to be used for multiplier input (1.0). 

F Field: Specifies sign of operation for adder input. 

S Field: Specifies sign of operation for product. 

N Field: Specifies the accumulator used for result. 

X Field: Specifies the multiplier input via accumulator direct or register indirect 
addressing modes. 

Y Field: Specifies the adder input via accumulator direct or register indirect 
addressing modes. 

Z Field: Specifies destination of result via register indirect addressing modes. 


EXAMPLE (For field encodings refer to Chapter 10.) 


al = *r5-- - “r7++ 


Bit 31.30 29 | 28 27 26 | 25 | 24 | 23 | 2221 20—14 13—7 6—0 
Encoding [ OOF [ 101 | O;o] 1 | 07 [0111111]0101110]0000111 
Meaning [| 0) | [+[-J[ at [s+ JT *r5-TNowrite_| 
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FADD-TAP 


INSTRUCTION: aN = [-] (Z=Y) {+,-} X 

CLASS: DA-Multiply/Accumulate 

DAU FLAGS AFFECTED: NZVU 

DESCRIPTION 

In this instruction, the operand specified by the X field is added to or subtracted from the operand specified by the 


Y field. The result or minus the result is stored in an accumulator (aN). The value of the Y operand is stored to a 
word address in memory. The operation actually performed is: aN = [-] (Z=Y) {+,-} 1.0 * X. 






Instruction Replacement Table 


| _aN _|a0—a3__—_—_| One of the four DAU accumulators 


X,Y *rP, *rP++, 32-bit memory location. ++ is +4 and -- is -4. 


“rP--, *rP++rl, rP refers to ri—14 and is used as a memory pointer. 


rl refers to r15—19 and is used as an increment register. 
LATENCY 


































ad—a3 One of the four DAU accumulators. 
32-bit memory location. ++ is +4 and -- is —4. 
rP refers to ri—14 and is used as a memory pointer. 


rl refers to r15—19 and is used as an increment register. 


For latency effects refer to Section 4.4.2. 


INSTRUCTION FORMAT (Format 4) 


Bit 31—29 | 28-26 | 25 24 23 | 2221 | 20—-14| 13~7 |_ 6-0 
Field. [ROOMS pain] v0. ner on eMS a8 | mane (aX Ss) ay =e [eZ S0 


F Field: Specifies sign of operation for adder input. 

S Field: Specifies sign of operation for product. 

N Field: Specifies the accumulator used for result. 

X Field: Specifies the multiplier input via accumulator direct or register indirect 
addressing modes. 

Y Field: Specifies the adder input via accumulator direct or register indirect 
addressing modes. 

Z Field: Specifies destination of result via register indirect addressing modes. 


EXAMPLE (For field encodings refer to Chapter 10.) 


al = (*r13 = a3) - *r1-- 










Bit 3130 29 | 28 27 26 | 25] 24] 23| 2221| 20-14 
Encoding [ 007 | 170 [O]O] 1] 01 [0001110]0000011}1101000 


Meaning [____[Formata] [+] -T[ ai | “3 


Bees Bee) 
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FLOAD-STORE 


INSTRUCTION: [Z=]aN=[-]Y 

CLASS: DA-Multiply/Accumulate 

DAU FLAGS AFFECTED: NZVU 

DESCRIPTION 

This instruction places the operand or minus the operand specified by the Y field into an accumulator (aN). If aZ 
field is specified, the result is also be stored to a word address in memory. The operation actually performed is 
[Z=] aN = [-] ¥ + 0.0* a0. Ifthe Y operand is not a valid DSP3210 floating-point number, it will be treated as a 


diry-zero and the accumulator will be loaded with zero. See Section 3.3.3 32-bit Floating-Point Data Type. To 
perform memory-to-memory transfers use FMULT-TAP. 











Instruction Replacement Table 


‘One of the four DAU accumulators 


Y “rP, *rP++, 32-bit memory location. ++ is +4 and -- is —4. 
*rP--, *rP++rl, rP refers to ri—14 and is used as a memory pointer. 
a0—a3 
Zz “rP, *rP++, 
rP++rl 
. 


rl refers to r15—19 and is used as an increment register. 
LATENCY 
For latency effects refer to Section 4.4.2. 






















One of the four DAU accumulators. 
32-bit memory location. ++ is +4 and -- is -4. 
rP refers to ri—14 and is used as a memory pointer. 

rl refers to r15—19 and is used as an increment register. 













INSTRUCTION FORMAT (Format 1) 
Bit 31—29 | 28—26 25 24 23 22—21 | 20—14| 13~7 | 6—0 


Fed [E00n [Mj 0/0 Rodjpesnal: Nuc} —X— [eevpezes] 


MField: — Specifies a constant value to be used for multiplier input (0.0). 

F Field: Specifies sign of operation for adder input. 

S Field: Specifies sign of operation for product (+). 

N Field: Specifies the accumulator used for result. 

Y Field: Specifies the adder input via accumulator direct or register indirect 
addressing modes. 

X Field: | The default specification is a0 as the multiplier input. 

Z Field: Specifies destination of result via register indirect addressing modes. 


EXAMPLE (For field encodings refer to Chapter 10.) 


*r1 = a2 = *r2-- 


0010110/0001000 
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FLOAT16 


INSTRUCTION: = [Z =] aN = float16(Y) 

CLASS: DA-Special Function 

DAU FLAGS AFFECTED: NZ,V=U=0 

DESCRIPTION 

This instruction converts the 16-bit 2's complement integer operand specified by the Y field to the equivalent 32-bit 
floating-point representation. The result is stored in an accumulator (aN). The integer operand Y can specify a 
16-bit memory location or an accumulator. When the post-increment or post-decrement addressing mode is used 


for the Y operand the value of the modifier is +2 or -2, respectively. Ifa Z field is specified, the result is also 
stored to a word address in memory. 















Instruction Replacement Table 


Replace [Value | Meaning, = 
*rP, *rP++, 16-bit memory location. ++ is +2 and -- is -2. 

*rP--, *rP++rl, rP refers to ri—14 and is used as a memory pointer. 

rl refers to r15—19 and is used as an increment register. 
One of the four DAU accumulators. 
32-bit memory location. ++ is +4 and -- is —4. 

rP refers to r1—14 and is used as a memory pointer. 

rl refers to r15—19 and is used as an increment register. 


















a0—a3 
*rP, *rP++, 
“rP--,*rP++rl 





LATENCY 
For latency effects refer to Section 4.4.2. 


INSTRUCTION FORMAT (Format 5) 
Bit 31—27 26—23 22—21 20—14 13—7 6—0 


Field [Once ae G | ENS /10,0,0'0.0'0.0 


G Field: Specifies the special function instruction type (float16). 

NField: Specifies the accumulator used for result. 

Y Field: Specifies adder input via accumulator direct or register indirect addressing 
modes. 

Z Field: Specifies destination of result via register indirect addressing modes. 


EXAMPLE (For field encodings refer to Chapter 10.) 


“r14+115 = a3 = float16(*r2) 








Bit 26—23 
Encoding [01111 0000000] 0010000|0001001 
Meaning [____ | _float16 aa 
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FLOAT32 


INSTRUCTION: = [Z =] aN = float32(Y) 

CLASS: DA-Special Function 

DAU FLAGS AFFECTED: NZ,V=U=0 

DESCRIPTION 

This instruction converts the 32-bit integer operand Y to the equivalent 32-bit floating-point representation. The 


result is stored in the accumulator aN and may optionally be written to memory. The 32-bit integer operand Y is 
located in a 32-bit memory location. If a Z field is specified, the result is stored to a word address in memory. 















Instruction Replacement Table 


[aN | a0—a3 
“rP, *rP++, 32-bit memory location. ++ is +4 and -- is -4. 

*rP--, *rP++rl, rP refers to ri—14 and is used as a memory pointer. 

rl refers to r15—19 and is used as an increment register. 
One of the four DAU accumulators. 
32-bit memory location. ++ is +4 and -- is -4. 
rP refers to ri—14 and is used as a memory pointer. 

rl refers to r15—19 and is used as an increment register. 




















a0—a3 
“rP, *rP++, 
*rP--,*rP+4rl 





LATENCY 
For latency effects refer to Section 4.4.2. 


INSTRUCTION FORMAT (Format 5) 


Bit 31—27 26—23 22-21 20—14 13-7 6-0 
Field [OMT] SN [L0:0.0:0'010.0 


G Field: Specifies the special function instruction type (float32). 

N Field: Specifies the accumulator used for result. 

Y Field: Specifies adder input via accumulator direct or register indirect addressing 
modes. 

Z Field: Specifies destination of result via register indirect addressing modes. 


EXAMPLE (For field encodings refer to Chapter 10.) 
*r5+4117 = a3 = float32(*r2) 

Bit 
Encoding 
Meaning “r5++r17 
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FMULT-ACC-STORE 


INSTRUCTION: [Z =] aN = [-] aM {+,-} Y*X 

CLASS: DA-Multiply/Accumulate 

DAU FLAGS AFFECTED: NZVU 

DESCRIPTION 

This instruction multiplies the operands specified by the X and Y fields. This product is then added to or 


subtracted from the contents or minus the contents of an accumulator (aM), and the result is stored in an 
accumulator (aN). If the Z field is specified, the result is also stored to a word address in memory. 








Instruction Replacement Table 


g 
‘One of the four DAU accumulators 


X,Y “rP, *rP++, 32-bit memory location. ++ is +4 and -- is -4. 
“rP--, *rP++rl, rP refers to r1i—14 and is used as a memory pointer. 
ad—a3 

Zz *rP, *rP++, 
rP--,*rP++rl 


tl refers to r15—19 and is used as an increment register. 
LATENCY 


























One of the four DAU accumulators. 
32-bit memory location. ++ is +4 and -- is —4. 
rP refers to ri—14 and is used as a memory pointer. 

tl refers to r15—19 and is used as an increment register. 









For latency effects refer to Section 4.4.2. 


INSTRUCTION FORMAT (Format 3) 


Bit 31—29 | 28-26 | 25 24 23 | 2221 | 20-14| 13~7 |_6—0 
Ried: | [2one ar fay 0 Epon tN a XE ae 


M Field; Specifies the accumulator used for adder input. 

F Field: Specifies sign of operation for adder input. 

S Field: Specifies sign of operation for product. 

N Field: Specifies the accumulator used for result. 

X,Y Fields: Specifies the multiplier inputs via accumulator direct or register indirect 
addressingmodes. 

Z Field: Specifies destination of result via register indirect addressing modes. 


EXAMPLE (For field encodings refer to Chapter 10.) 


“r++ = a0 = al —“r2-- * “1344117 


Bit 31.30 29 | 28 27 26 | 25] 24| 23] 2221] 20—14 137 6—0 
Encoding | 017 | 007 [ofTof{1{ 00 [0011011/0010110 
Pe a (a) (| a a 





FMULT-ACC-TAP 


INSTRUCTION: aN = [-] aM {+,-} (Z=Y) * X 
CLASS: DA-Multiply/Accumulate 

DAU FLAGS AFFECTED: NZVU 

DESCRIPTION 


This instruction multiplies the operands specified by the X and Y fields. The result of the multiplication (Y * X) is 
added to or subtracted from the contents of an accumulator (aM), and the result is stored in an accumulator (aN). 


The value of the Y field operand is stored to a word address in memory. 















Instruction Re| 


Replace g 
One of the four DAU accumulators 

*rP, *rP++, 32-bit memory location. ++ is +4 and -- is -4. 
*rP--, *rP++rl, rP refers to r1—14 and is used as a memory pointer. 

rl refers to r15—19 and is used as an increment register. 
One of the four DAU accumulators. 
32-bit memory location. ++ is +4 and -- is —4. 
rP refers to ri—14 and is used as a memory pointer. 

rl refers to r15—19 and is used as an increment register. 





placement Table 













X,-¥. 


LATENCY 















a0—a3 
*eP, “rP++, 
“*rP--,*rP++rl 


For latency effects refer to Section 4.4.2. 


INSTRUCTION FORMAT (Format 2) 


Bit 31—29 | 28-26 | __25 24 23 | 22-21 | 20—14| 13~7 |_6—0 
Field) [2070” eeME a) OSs |= Pes [Sos SNe ef eee 


MField: Specifies the accumulator used for adder input. 

F Field: Specifies sign of operation for adder input. 

S Field: Specifies sign of operation for product. 

N Field: Specifies the accumulator used for result. 

X,Y Fields: Specifies the multiplier inputs via accumulator direct or register indirect 
addressing modes. 

Z Field: Specifies destination of result via register indirect addressing modes. 


EXAMPLE (For field encodings refer to Chapter 10.) 


a0 =-al + (*r2-- = *13++r17) * “r++ 























Bit 3130 29 | 28 27 26 | 25 | 24 | 23 | 22.21 137 6-0 
Encoding 007 [0]1] 0] 00 [0001111/0011011/0010110 
Meaning [| at | {—[+ [a0 [rive [rdeert7@ fre 





FMULT-ADD-STORE 


INSTRUCTION:  [Z=] aN =[-] Y {+,-} aM* X 

CLASS: DA-Multiply/Accumulate 

DAU FLAGS AFFECTED: NZVU 

DESCRIPTION 

This instruction multiplies the operand specified by the X field with the contents of an accumulator (aM). This 
product is then added to or subtracted from the operand or minus the operand specified by the Y field and the 


result is stored in an accumulator (aN). If the Z field is specified, the result is also stored to a word address in 
memory. 


Instruction Replacement Table 


[aval __{aoa3_—_ ‘One of the four DAU accumulators 


“rP++, 32-bit memory location. ++ is +4 and -- is —4. 
“P--, “rP++rl, rP refers to r1—14 and is used as a memory pointer. 


tl refers to r15—19 and is used as an increment register. 
a0—a3 One of the four DAU accumulators. 

32-bit memory location. ++ is +4 and -- is —4. 

tP refers to ri—14 and is used as a memory pointer. 

rl refers to r15—19 and is used as an increment register. 





LATENCY Ee 
For latency effects refer to Section 4.4.2. 


INSTRUCTION FORMAT betas 1) 
Bit 31—29 | 28—26 23 22—21 | 20—14 | 13—7 | 6—0 


Field ci es SS 


MField: Specifies the accumulator used for multiplier input. 

F Field: Specifies sign of operation for adder input. 

S Field: Specifies sign of operation for product. 

N Field: Specifies the accumulator used for result. 

X Field: Specifies the multiplier input via accumulator direct or register indirect 
addressing modes. 

Y Field: Specifies the adder input via accumulator direct or register indirect 
addressing modes. 

Z Field: Specifies destination of result via register indirect addressing modes. 


EXAMPLE (For field encodings refer to Chapter 10.) 


*r2++116 = a2 = *r14++ - a0 * *r10-- 


Bit 31 30 29 | 282726 | 25] 24|23|2221| 20-14 13-7 
Encoding [ 001 | ooo [ofoti{ 10 [1010110]1110111] THOT 
CT i a ee a a (AE (TS Ta 





FMULT-STORE 


INSTRUCTION: [Z=]aN=[-]Y*X 
CLASS: DA-Multiply/Accumulate 

DAU FLAGS AFFECTED: NZVU 
DESCRIPTION 


This instruction multiplies the operands specified by the X and the Y fields. The product or minus the product is 
stored in an accumulator (aN). If the Z field is specified, the result is also stored to a word address in memory. 
The operation actually performed is [Z=] aN = 0.0 {+,-} Y * X 


Instruction Replacement Table 


| San. iatas_ 2S 


ey, *rP, *rP++, 32-bit memory location. ++ is +4 and -- is —4. 
“rP--, *rP++rl, rP refers to r1i—14 and is used as a memory pointer. 


rl refers to r15—19 and is used as an increment register. 
One of the four DAU accumulators. 

32-bit memory location. ++ is +4 and -- is —4. 

rP refers to r1—14 and is used as a memory pointer. 

rl refers to r15—19 and is used as an increment register. 





LATENCY 
For latency effects refer to Section 4.4.2. 


INSTRUCTION FORMAT (Format 3) 


Bit 31—29 | 28-26 | 25 24 23 | 22-21 | 20-14 13~7 | 6-0 
iid [0 ae OE A Fete | ta ff IN Pea aye | ec 


MField: Specifies a constant value to be used for adder input (0.0). 

F Field: Specifies sign of operation for adder input (+). 

S Field: Specifies sign of operation for product. 

N Field: Specifies the accumulator used for result. 

X.Y Fields: Specifies the multiplier inputs via accumulator direct or register indirect 
addressing modes. 

Z Field: Specifies destination of result via register indirect addressing modes. 


EXAMPLE (For field encodings refer to Chapter 10.) 


*r2+4117 = a3 = “1124+ * *r11-- 


Bit 3130 29 | 28 27 26 | 25 | 24 | 23 | 2221 20—14 13—7 6—0 


Encoding [ 0177 | 100 |[O][O] Of 11 [1011110]1100111/0010011 
Meaning [Too T+ P+ fT as Prt ee re rt7_ | 





FMULT-TAP 


INSTRUCTION: aN =[-] (Z=Y) * X 

CLASS: DA-Mulitply/Accumulate 

DAU FLAGS AFFECTED: NZVU 

DESCRIPTION 

This instruction multiplies the operands specified by the X and Y fields. The product Y * X or minus the product is 
stored in an accumulator (aN). The value of the Y operand is stored to a word address in memory. The 


operation actually performed is: aN = 0.0 {+,-} (Z=Y) * X. This instruction can be used for 32-bit memory-to- 
memory transfers of arbitrary data, e.g. a0 = (Z=Y) * a0. 







Instruction Replacement Table 




















[aN _[a0—a3__—_—_—_—| One of the four DAU accumulators 
“rP, *rP++, 32-bit memory location. ++ is +4 and -- is -4. 
*rP--, *rP++rl, rP refers to r1—14 and is used as a memory pointer. 





rl refers to r15—19 and is used as an increment register. 
One of the four DAU accumulators. 
32-bit memory location. ++ is +4 and -- is —4. 
rP refers to r1i—14 and is used as a memory pointer. 

rl refers to r15—19 and is used as an increment register. 












LATENCY . 
For latency effects refer to Section 4.4.2. 


INSTRUCTION FORMAT (Format 2) 


Bit 31—29 | 28-26 | 25 24 23 | 22-21 | 20-14| 13~7 
Fed) [aosiou[as Maat [fee peek Vee 


MField: Specifies a constant value to be used for adder input (0.0). 

F Field: Specifies sign of operation for adder input (+). 

S Field: Specifies sign of operation for product. 

N Field: Specifies the accumulator used for result. 

X,Y Fields: Specifies the multiplier inputs via accumulator direct or register indirect 
addressing modes. 

Z Field: Specifies destination of result via register indirect addressing modes. 


EXAMPLE (For field encodings refer to Chapter 10.) 
aO = - (*r2++r16 = “r1) * *r8-- 






Bit 31.30 29 | 282726 | 25| 24] 23] 2221| 20-14 13-7 6—0 
Encoding [ 010 | 100 | O]|O] 7] 00 [1000110/000100010010010 
Meaning [| (0.0) | |+)-| ao [| re-  [*rt  [ “2++ri6 








GOTO 


INSTRUCTION: goto {N, M, rB, rB+N, rB+M} 
CLASS: CA-Control 

CAU FLAGS AFFECTED: None 

DESCRIPTION 


This instruction is an unconditional goto. When executed, the specified value in the braces {} is moved to the 
program counter (pc). The values in the braces {} specify either an address N (16-bit MSB extended to 32-bits), 
an address M (24-bit zero-extended to 32-bits), a CAU register rB, a value in a CAU register rB plus a 
displacement value N (16-bit MSB extended to 32-bits), or a value in a CAU register rB plus a displacement value 
M (24-bit zero extended to 32-bits). N is always 16 bits and is sign-extended to 32 bits for all operations. Mis 
always 24-bits and it zero-extended to 32-bits. The goto M instruction is a format 8a instruction. The remaining 
instructions are encoded as format 0-1b, special cases of the if (COND) goto instruction. In this case, the branch 
is always taken. When using pc as rB, the syxtax: 

pcgoto label 
may be used to perform a pc-relative branch. The linker will compute the appropriate N (offset) value. This 
instruction disables interrupts for one instruction cycle. 


Instruction Replacement Table 
Replacement 


pe, ro—r22 The program counter, r0 (hard-wired zero), or one of 
22 general-purpose registers. 
Nes 






16-bit number | 2's complement integer 
24-bit number. | Unsigned integer 






LATENCY 


When the above instruction is executed, the instruction immediately following is also executed before the branch 
occurs. Normally, a nop is placed after the branch instruction. For details on latency effects refer to Section 4.4.2. 


INSTRUCTION FORMAT (Format 0—1b) 


Bit 31-27 26—21 20—16 
Field fooga [ee 


C Field: Specifies the condition (true). 

1B Field: | Specifies the base address register. When N specifies 0, rB is the 
destination address. 

N Field: Specifies a 16-bit 2's complement integer used as an offset value for the 
base address. When rB specifies r0, the N field is an absolute address. 


INSTRUCTION FORMAT (Format 8a) 


Bit 31-29 28-21 20—16 15—0 
Field a i ee FE a ee ea ees) 


rB Field: Specifies the base address register. 

NE Field: Specifies the most significant 8-bits of the 24-bit integer. NE concatonated 
with N forms the 24-bit unsigned integer (NE || N). 

N Field: Specifies the least significant 16-bits of the 24-bit unsigned integer. 





GOTO 


INSTRUCTION (cont.): goto {N, M, rB, rB+N} 
EXAMPLE (For field encodings refer to Chapter 10.) 


goto r1 
Bit 31—27 26—21 20—16 


15—0 
Encoding 70000 | 000001 | 00007 |0000000000000000 
Meaning ae [eer an ame (remem ee ay) 0x0000 





GOTO-COND 


INSTRUCTION: __ if(COND) goto {N, rB, rB+N} 
CLASS: CA-Control 

CAU FLAGS AFFECTED: None 

DESCRIPTION 


This instruction is a conditional branch instruction that depends on the values of the flags. The condition (CAU 
COND, DAU COND, or IO COND) is selected from the tables below. CAU COND flags are established by the 
previous CA instruction that set flags. DAU COND flags are established by the most recent DA instruction that 
occurs four or more instructions prior to the condition test. IO COND flags can be affected by external events, 
and program or DMA activity. If the condition evaluates true, the value of the program counter (pc) is replaced 
with one of the values specified in the braces {}. These values include a CAU register rB, an address N (MSB 
extended), or a value in a CAU register rB plus a displacement value N (MSB extended). N is a 16-bit 2's 
complement number and is MSB-extended to 32 bits. When using pc as rB, the syxtax: 

if (COND) pcgoto label 
may be used to perform a pe-relative branch. The linker will compute the appropriate N (offset) value. This 
instruction disables interrupts for one instruction cycle. 


Replacement Table for CA Instructions 
COND) - CAU Conditions 


Condition | Flagt _| 
false Condition is always false 
true Not applicable _| Condition is always true 

Result is nonnegative (plus) 

Result is negative (minus' 


ee Result not equal to zero 


g 
Result equal to zero 
ve ‘Overflow clear, no overflow 
vs v=1 Overflow set, overflowed 
Carry clear, no carry 
c=1 Carry set, car 
Greater than or equal to zero 
nave Less than zero 
Greater than zero 
Z| (n4v)= Less than or equal to zero 
i ae as Greater than (unsigned number) 
ol Z=1 Less than (unsigned number} 


t Symbol interpretation: 4 = XOR; | OR 







° 
N 





Replacement Table for CA Instructions 
(COND) - DAU Conditions 
lp ee 
Not equal to zero 
Equal to zero 
Greater than or equal to zero 
Less than zero 
Overflow clear, no overflow 
Overflow set, overflowed 
Underflow clear, no underflow 
Underflow set, underflowed 
Greater than zero 
Less than or equal to zero 


Condition 


age 





ave 


ie 
Ts A he 


agt 
ale 


2 
ty 
zz 
se Cle < 
DA SIE SIA OI” oe 
=“ 





GOTO-COND 


INSTRUCTION (cont.): —_ if(COND) goto {N, rB, rB+N} 









Replacement Table for CA Instructions 
(COND) - 10 Conditions 


[Condition [Flag__-| SCM’ SS 
if bf=1 Input buffer full 

ae 
obf obe=0 Output buffer full 

syc S| Sync signal low 
ee 

fos fb=1 Serial frame boundary set 

irOn=1 IRON pin is logic one (1). 

ir aac irOn=0 IRON pin is logic zero (0). 

irte irtn=0 IR1N pin is logic zero (0). 
Replacement [| Meaning 


B pe, roO—r22 The program counter, r0 (hard-wired zero), or one of 
22 general-purpose registers. 
[___N____|__16-bitnumber _| 2's complement integer 






















LATENCY 


When the above instruction is executed, the instruction immediately following is also executed before the branch 
occurs. A DAU condition tested by a conditional branch or conditional arithmetic/logic instruction is established by 
the last DA instruction that affects DAU flags no sooner than four instructions prior to the test. For details on 
latency effects refer to Section 4.4.2. 


INSTRUCTION FORMAT (Format 0-1b) 
Bit 31-27 26—21 20—16 15—0 
Field 1000 0s Cereal eearI Bat aah aa —  sNezs | 


C Field: Specifies the condition. 

rB Field: Specifies the base address register. When N specifies 0, rB is the 
destination address. 

N Field: Specifies a 16-bit 2's complement integer used as an offset value for the 
base address. When rB specifies r0, the N field is an absolute address. 


EXAMPLE (For field encodings refer to Chapter 10.) 
if(pl) goto r1+0x0800 

Bit 
Encoding 
Meaning 











GOTO-LOOP 


INSTRUCTION: _ if(rM—>=0) goto {N, rB, rB+N} 
CLASS: CA-Control 

CAU FLAGS AFFECTED: None 

DESCRIPTION 


This instruction is primarily used as a loop counter. The value of rM is treated as a 32-bit number and is 
decremented each time the instruction is executed. If the value of rM is greater than or equal to zero, the value of 
the pe changes to one of the specified locations in the braces {}. If Mis less than zero, the next instruction is 
executed. The values in the braces {} specify a CAU register rB, an address N (MSB extended) or a value in a 
CAU register rB plus a displacement value N (MSB extended). N is always 16 bits and is sign-extended to 32 
bits for all operations. When using pc as rB, the syxtax: 

if ((M- >=0) pcgoto label 
may be used to perform a pc-relative branch. The linker will compute the appropriate N (offset) value. This 
instruction disables interrupts for one instruction cycle. 


Instruction Replacement Table 


pe, roO—r22 The program counter, r0 (hard-wired zero), or one of 
22 general-purpose registers. 
fae ; 
aa i 








One of 22 CAU registers. 
16-bit number | 2's complement integer 


LATENCY . 





When the above instruction is executed, the instruction immediately following is also executed before the branch 
occurs. For details on latency effects refer to Section 4.4.2. 


INSTRUCTION FORMAT (Format 3a) 


Bit 31—26 25-21 20—16 
Field 0.0:0.0°-1-J— ma JB] 


tM Field: Specifies the register used for the loop counter. 

rB Field: — Specifies the base address register. When N specifies 0, rB is the 
destination address. 

N Field: Specifies a 16-bit 2's complement integer used as an offset value for the 
base address. When rB specifies r0, the N field is an absolute address. 


EXAMPLE (For field encodings refer to Chapter 10.) 


if(ri— >=0) goto r2 - Ox00ab 


Bit 31—26 25—21 20—16 15—0 
Encoding 000011 | 00001 00010 |17111111101010101 
Meaning jf rt J re Oxf55 





Ic 


INSTRUCTION: = [Z =] aN = ic(Y) 
CLASS: DA-Special Function 

DAU FLAGS AFFECTED: NZ,V=U=0 
DESCRIPTION 


This instruction converts the operand specified by the Y field, an 8-bit .-law, A-law, or unsigned word (depending 
on the value of the dauc register), to an equivalent 32-bit floating-point number. The result is stored in an 
accumulator (aN). The Y operand can specify a byte memory location or an accumulator. A detailed description 
of data formats is presented in 8.2.4 - Type Conversions. If Y specifies an accumulator, the 8-bit word must 
reside in the most significant byte of the accumulator. If a Z field is specified, the result is also stored to a word 
address in memory. 












Instruction Replacement Table 


[aN _|a0Q—a3__—_| One of the four DAU accumulators 
NY! 
= 


*rP, *rP++, 8-bit memory location. ++ is +1 and -- is -1. 
LATENCY 














*rP--, *rP++rl, rP refers to ri—14 and is used as a memory pointer. 

rl refers to r15—19 and is used as an increment register. 
One of the four DAU accumulators 

32-bit memory location. ++ is +4 and -- is —4. 

rP refers to ri—14 and is used as a memory pointer. 

rl refers to r15—19 and is used as an increment register. 












a0—a3 
“TP, “P+, 
*rP--,*rP++rl 








For latency effects refer to Section 4.4.2. 


INSTRUCTION FORMAT (Format 5) 


Bit 31-27 26—23 22-21 20—14 13-7 6-0 
Field Loiiii [| Gf N  [oo00000 


G Field: Specifies the special function instruction type (ic). 

N Field: Specifies the accumulator used for result. 

Y Field: Specifies adder input via accumulator direct or register indirect addressing 
modes. 

Z Field: Specifies destination of result via register indirect addressing modes. 





EXAMPLE (For field encodings refer to Chapter 10.) 


“r++ = a0 = ic(*r2++) 


















Bit 3127 26—23 22.21 
Encoding 0000 | 00 [0000000] 0010111/0001111 . 
ae [eee ee Ns | Cg me fee [ee oon | emer 
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IEEE 


INSTRUCTION: = [Z =] aN = ieee(Y) 
CLASS: DA-Special Function 

DAU FLAGS AFFECTED: None 
DESCRIPTION 


This instruction converts the Y operand, a DSP32C format floating-point number, to an IEEE format, floating-point 
number. The Y operand can specify a memory location or an accumulator. If the exponent of the DSP32C 
number is zero, the converted IEEE number is 0x00000000 or positive zero. DSP32C to IEEE conversion never 
results in NaN (not a number). If a Z field is specified, the result is also stored to a word address in memory. In 
order to get the IEEE number correctly formatted in memory, the Z write must be performed in the instruction that 
specifies the ieee conversion. 
















Instruction Replacement Table 


Replace 
‘One of the four DAU accumulators 


“rP, *rP++, 32-bit memory location. ++ is +4 and -- is —4. 

“rP--, *rP++rl, rP refers to ri—14 and is used as a memory pointer. 

tl refers to r15—19 and is used as an increment register. 
ad—a3 One of the four DAU accumulators. 
*rP, *rP++, 32-bit memory location. ++ is +4 and -- is —4. 
*rP--,*rP++rl rP refers to r1i—14 and is used as a memory pointer. 

tl refers to r15—19 and is used as an increment register. 














LATENCY 
For latency effects refer to Section 4.4.2. 


INSTRUCTION FORMAT (Format 5) 
Bit Si—27. 26—23 22—21 20—14 13—7 6—0 


Field ESR Sc Sas Bae ES i ee es a 


G Field: Specifies the special function instruction type (ieee). 

N Field: Specifies the accumulator used for result. 

Y Field: Specifies adder input via accumulator direct or register indirect addressing modes. 
Z Field: Specifies destination of result via register indirect addressing modes. 


EXAMPLE (For field encodings refer to Chapter 10.) 





*r4-- = al = ieee(*r9++) 


Bit 31-27 26—23 2221 20—14 13—7 6—0 
Encoding 0000000] 1001111/0100110 
Meshing [ei saens) aloes | aaianwjes eat ees fy re | 
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IFAEQ 


INSTRUCTION: = [Z =] aN = ifaeq(Y) 
CLASS: DA-Special Function 

DAU FLAGS AFFECTED: None 
DESCRIPTION 


This instruction stores the Y operand in accumulator aN only if the result of the most recent DA instruction is 
equal to zero. If itis not zero, aN will retain its old value. Regardless, the Y address calculations are performed 
(register postmodification is performed if specified). The Y operand can specify a word memory location or an 
accumulator. Based on dauc{6], the Z-field operation can be conditional. If dauc[6]=0, the Z-field operation, if 
specified, is not conditional. Note that if the test fails, the previous value of aN is stored to memory. If 

dauc[6] = 1, the write to memory, if specified, is also conditional, but the address postmodifications are always 
performed. If written, the value of aN is stored to a word address in memory. 









Instruction Replacement Table 


32-bit memory location. ++ is +4 and -- is —4. 

rP refers to ri—14 and is used as a memory pointer. 

rl refers to r15—19 and is used as an increment register. 
One of the four DAU accumulators. 
32-bit memory location. ++ is +4 and -- is -4, 
rP refers to ri—14 and is used as a memory pointer. 

rl refers to r15—19 and is used as an increment register. 



















LATENCY 
For latency effects refer to Section 4.4.2. 


INSTRUCTION FORMAT (Format 5) 


Bit 31—27 26—23 22—21 20—14 13—7 6—0 
Field ES a (a DS OAT ae eee (a 
G Field: Specifies the special function instruction type (ifaeq). 
NField: Specifies the accumulator used for result. 
Y Field: Specifies adder input via accumulator direct or register indirect addressing modes. 
Z Field: Specifies destination of result via register indirect addressing modes. 


EXAMPLE (For field encodings refer to Chapter 10.) 


a0 = ifaeq(*r7) 















Bit 31-27 26—23 2221 
Encoding [_01111 | O110 [| 00 [oo00000/0111000}0000111 
Pe Fae eS (75 Sl ST) [R(T 
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IFAGT 


INSTRUCTION: [Z =] aN = ifagt(Y) 
CLASS: DA-Special Function 

DAU FLAGS AFFECTED: None 
DESCRIPTION 


This instruction stores the Y operand in accumulator aN if and only if the result of the most recent DA instruction 
is greater than zero. If itis not greater than zero, aN retains its old value. Regardless, the Y address calculations 
are performed (register postmodification is performed if specified). The Y operand can specify a word memory 
location or an accumulator. Based on dauc(6], the Z-field operation can be conditional. If dauc[6] = 0, the Z-field 
operation, if specified, is not conditional. Note that if the test fails, the previous value of aN is stored to memory. 
If dauc[6] = 1, the write to memory, if specified, is also conditional, but the address postmodifications are always 
performed. If written, the value of aN is stored to a word address in memory. 












Instruction Replacement Table 


Replace 


| _aN _|a0—a3__—_| One of the four DAU accumulators 
32-bit memory location. ++ is +4 and -- is —4. 
rs *rP, *rP++, 
*rP--,"rP++rt 
. 


rP refers to r1—14 and is used as a memory pointer. 
LATENCY 










rl refers to r15—19 and is used as an increment register. 
One of the four DAU accumulators. 
32-bit memory location. ++ is +4 and -- is -4. 
rP refers to ri—14 and is used as a memory pointer. 

rl refers to r15—19 and is used as an increment register. 
















For latency effects refer to Section 4.4.2. 


INSTRUCTION FORMAT (Format 5) 
Bit 31—27 26—23 22—21 20—14 13—7 


6—0 
Field [Loiiii [  G  [ N  [oo00000 


G Field: Specifies the special function instruction type (ifagt). 

N Field: Specifies the accumulator used for result. 

Y Field: Specifies adder input via accumulator direct or register indirect addressing 
modes. 

Z Field: Specifies destination of result via register indirect addressing modes. 


EXAMPLE (For field encodings refer to Chapter 10.) 


“113 = al = ifagt(*r9++r18) 


Bit 31-27 26—23 2221 20—14 13—7 6—0 
Encoding [01711 | 07177 [| 01 [0000000] 100110011101000 
Meaning) [ate fh tag = [ata ee | eerie Psy 
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IFALT 


INSTRUCTION: [Z =] aN = ifalt(Y) 
CLASS: DA-Special Function 

DAU FLAGS AFFECTED: None 
DESCRIPTION 


This instruction stores the Y operand in an accumulator (aN) only if the result of the most recent DA instruction is 
negative. If it is not negative, aN retains its old value. Regardless, the Y address calculations are performed 
(register postmodification is performed if specified). The Y operand can specify a word memory location or an 
accumulator. Based on dauc{6], the Z-field operation can be conditional. If dauc[6] = 0, the Z-field operation, if 
specified, is not conditional. Note that if the test fails, the previous value of aN is stored to memory. If 

dauc{6] = 1, the write to memory, if specified, is also conditional, but the address postmodifications are always 
performed. If written, the value of aN is stored to a word address in memory. 
















Instruction Replacement Table 


[aN | a0—a3 
*rP, *rP++, 32-bit memory location. ++ is +4 and -- is -4. 

“rP--, *rP++rl, rP refers to r1—14 and is used as a memory pointer. 

rl refers to r15—19 and is used as an increment register. 
One of the four DAU accumulators. 
32-bit memory location. ++ is +4 and -- is -4. 
rP refers to ri—14 and is used as a memory pointer. 

rl refefs to r15—19 and is used as an increment register. 






















a0—a3 
“rP, *rP++, 
*rP--,*rP++rl 


LATENCY 
For latency effects refer to Section 4.4.2. 


INSTRUCTION FORMAT (Format 5) 


Bit 31—27 26—23 22-21 20—14 13-7 6—0 
Field Loiiii [ GN [0000000 


GField: Specifies the special function instruction type (ifalt). 

N Field: Specifies the accumulator used for result. 

Y Field: Specifies adder input via accumulator direct or register indirect addressing 
modes. 

Z Field: Specifies destination of result via register indirect addressing modes. 





EXAMPLE (For field encodings refer to Chapter 10.) 
*r11-- = a2 = ifalt(*r7--) 









Bit 31—27 26—23 2221 20—14 13—7 6-0 
Encoding [07717 | O01 [70 [ooooooo/0111110{1011110 
Meaniig) — [= Se atone oe a [ee eee eet 
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INCR/DECR 


INSTRUCTION: _ [if (COND)]rD1 = {{(long)],(short)} rS {4-31 (1) 
{if (COND)] sp = [(long)] sp {++,--} (2) 


CLASS: CA-Arithmetic/Logic 
CAU FLAGS AFFECTED: nzve 
DESCRIPTION 


(1) This instruction increments or decrements the contents of register rS by 1 and places the result in register rD1 
(can not be the stack pointer). The items in the brackets [] are optional. A 32-bit operation, (long), is the default 
size and does not have to be specified. (short) indicates that a 16-bit operation is desired. For the (short) 
operation, the result is sign extended from bit 15 to bit 31. Flags are set according to the 16- or 32-bit operation 
flag rules (see Table 4-1), as specified by the instruction. If the instruction is conditionally executed and the 
COND is true, the result is placed in the register rD, and the flags are set. If the instruction is conditionally 
executed and the COND is false, neither the register rD nor the flags are affected. The COND field is the same 
as for the add operations (see page 4-21 ADD). 


(2) This instruction increments or decrements the contents of the register sp (stack pointer) by 4. The source 
register and the destination register are sp. The instruction can only be executed as 32-bits. This instruction 
disables interrupts for one instruction cycle. If the instruction is conditionally executed and the COND is true, 
the result is placed in the register sp, and the flags are set. If the instruction is conditionally executed and the 
COND is false, neither the register sp nor the flags are affected. The COND field is the same as for the add 
operations (see page 4-21 ADD). 


Instruction Replacement Table 
Term[ Replacement [Meaning 


[ ri | __r0—r20,r22 _—| r0 (hard-wired zero), or one of 21 CAU registers 
TO (hard-wired zero), or one of 22 CAU registers 


INSTRUCTION FORMAT (1-2) (Format 6a—b) 







Bit 31 30—25 24—21 20—16 15—11 10—5 4-0 
Field Een Eh ee Oe ee eS 
E Field: Specifies the size (16/32) of the ALU operation. 
rD Field: Specifies the destination register. 
rS1,rS2 Fields: Specifies the source register. 
F Field: Specifies the ALU function to be performed (+). 
C Field: Specifies the condition being tested. 


EXAMPLE (For field encodings refer to Chapter 10.) 


if(vs) 5 =15 +1 


















Bit 30—25 24—21 | 20-16 | 15-11 10—5 4-0 
Encoding [0 | 001100 0000 00101 [000111| 10111 
Meaning a 15 5 
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INT16 


INSTRUCTION: [Z =] aN = int16(Y) 
CLASS: DA-Special Function 

DAU FLAGS AFFECTED: None 
DESCRIPTION 


This instruction converts the floating-point operand Y to a 16-bit 2's complement integer. The conversion can be 
done using one of three modes specified by the dauc register bits 4 and 5: round to nearest (dauc[5—4] = 00), 
truncate towards -e> (dauc[5—4] = 10), or truncate towards zero (dauc[5—4] = 11). The default (on reset) value of 
dauc[5—4] = 00. The Y operand can specify an accumulator or a float memory location. The result is stored in 
the upper 16 bits of an accumulator (aN). The lower portion of aN contains unpredictable data. If the Y operand 
exceeds the range of allowable 16-bit integers (-2'5, 215 — 1), the result is a saturated 16-bit integer. If a Z field 
is specified, the result is stored to a half-word address in memory. In order to write the 16-bit result to memory as 
a 16-bit write, the Z-write must be performed in this instruction. When the post-increment or the post-decrement 
addressing mode is used for the Z operand, the value of the modifier is +2 or -2, respectively. 













Instruction Replacement Table 


[res RephaC8, =| cy NANOS done ns os ween N@ONINY ts ar a 
[aN _[ad—a3__—_—[ One of the four DAU accumulators 
Y 
a0—a3 
“rP++, 


*rP, *rP++, 32-bit memory location. ++ is +4 and -- is —4. 
|S fete | 


“rP--, *rP++rl, rP refers to r1—14 and is used as a memory pointer. 
LATENCY 














rl refers to r15—19 and is used as an increment register. 
One of the four DAU accumulators. 
16-bit memory location. ++ is +2 and -- is -2. 
rP refers to ri—14 and is used as a memory pointer. 

tl refers to r15—19 and is used as an increment register. 



















For latency effects refer to Section 4.4.2. 


INSTRUCTION FORMAT (Format 5) 


Bit 31—27 26—23 22-21 20—14 13-7 6-0 
Field 5 ee eS a EE I Ss 


G Field: Specifies the special function instruction type (int16). 

N Field: Specifies the accumulator used for result. 

Y Field: Specifies adder input via accumulator direct or register indirect addressing modes. 
Z Field: Specifies destination of result via register indirect addressing modes. 


EXAMPLE (For field encodings refer to Chapter 10.) 


*r3 = a0 = int16(a0) 


Bit 31-27 26—23 2221 20—14 13—7 6—0 
Encoding O1ii1 [| 00171 [ 00 [0000000] 0000000/0011000 
DT a Sal (a(S SS) [a RR a cc 
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INT32 


INSTRUCTION: —[Z =] aN = int32(Y) 
CLASS: DA-Special Function 

DAU FLAGS AFFECTED: None 
DESCRIPTION 


This instruction converts the floating-point operand Y to a 32-bit 2's complement integer. The conversion can be 
done using one of three modes specified by the dauc register bits 4 and 5: round to nearest (dauc[5—4] = 00), 
truncate towards -e> (dauc[5—4] = 10), or truncate towards zero (dauc[5—4] = 11). The default (on reset) value 
of dauc[5—4] = 00. The Y operand can specify an accumulator or a 32-bit memory location. The result is stored 
in the 32 MSBs of an accumulator (aN), the mantissa, and guard bits. The remaining bits of aN contain 
unpredictable data. If the Y operand exceeds the range of allowable 32-bit integers (-231 , 231 — 1), the result is 
a saturated 32-bit integer. If a Z field is specified, the result is stored to a word address in memory. In order to 
get the result correctly formatted in memory, the Z-write must be performed in this instruction. 












Instruction Replacement Table 


Meaning 
[aN _|a0—a3__—_—_| One of the four DAU accumulators 


*rP, *rP++, 32-bit memory location. ++ is +4 and -- is -4. 
LATENCY 



















*rP--, *rP++rl, rP refers to ri—14 and is used as a memory pointer. 
rl refers to r15—19 and is used as an increment register. 
One of the four DAU accumulators. 
32-bit memory location. ++ is +4 and -- is —4. 
rP refers to ri—14 and is used as a memory pointer. 
rl refers to r15—19 and is used as an increment register. 







ad—a3 
*rP, *rP++, 
*rP--,"rP++r1 


For latency effects refer to Section 4.4.2. 


INSTRUCTION FORMAT (Format 5) 


Bit 31-27 26—23 22-21 20-14 13-7 6-0 
Field Ri Cen ee OO a a 


G Field: Specifies the special function instruction type (int32). 

N Field: Specifies the accumulator used for result. 

Y Field: Specifies adder input via accumulator direct or register indirect addressing 
modes. 

Z Field: Specifies destination of result via register indirect addressing modes. 





EXAMPLE (For field encodings refer to Chapter 10.) 


"124+ = a2 = int32(*r1++) 




















Bit 31—27 2221 20-14 
Encoding [01117 | 1011 | 10 [0000000] 0001111 7100111 
Ph fae aa) a Ge (ee ES ap 
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IRETURN 


INSTRUCTION: __ ireturn 

CLASS: CA-Control 

CAU FLAGS AFFECTED: Restored from interrupt shadow register 
DAU FLAGS AFFECTED: Restored from interrupt shadow register 
DESCRIPTION 


This instruction performs a return-from-interrupt operation. This operation consists of returning DSP3210 control 
to the point of interruption and restoring the internal registers from the shadow registers. This instruction is 
encoded and executes as if(true) goto pesh. 


LATENCY 


Unlike other CA control group instructions, the ireturn instruction does not have to be followed by a nop, since the 
instruction following the ireturn in memory is not executed. The instruction that was fetched prior to the interrupt is 
executed in its place. For details on latency effects refer to Section 4.4.2. 


INSTRUCTION FORMAT (Formats 0b and 1b) 
Bit 31-27 26-21 20—16 15—0 
Field STOIC [eS | Pee be NEE] 


C Field: Specifies the condition (true). 

1B Field: | Specifies the base address register (pcsh). 

N Field: Specifies a 16-bit 2's complement integer used as an offset value for the 
base address (zero). 


EXAMPLE (For field encodings refer to Chapter 10.) 


ireturn 


Bit 31—27 26—21 20—16 15—0 
Encoding 10000 000001 | 11110 [0000000000000000 
Meaning [te fT pesh_ 0x0000 
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LOAD 


INSTRUCTION: rD = {(char),(byte),(hbyte), (short), (ushort),[(long)]} MEM (1) 
rD = {(char),(byte),(hbyte),(short),(ushort),[(long)]} *L (2) 
rD = {(char),(byte),(hbyte), (short), (ushort),[(long)]} iorS (3) 
CLASS: CA-Data Move 


CAU FLAGS AFFECTED: None 
DESCRIPTION 


This instruction moves data from a memory location or IO register specified on the right-hand side to a register rD. 
The default size of the transfer is (long) and does not have to be specified. The size of the transfer and the CAU 
register bits affected by the transfer are specified by the key words: (char), (byte), (byte), (short), (ushort), and 
(long). (char), (byte) and (hbyte) are 8-bit transfers. (char) loads bits 7—0 with the data and sign extends bit 7 to 
bit 31. (byte) loads bits 7—0 with the data and zero extends to bit 31. (hbyte) loads bits 15—8 with the data, and 
zeros bits 7—0 and 31—16. (short) and (ushort) are 16-bit transfers. (short) loads bits 15—0 with the data and 
sign extends to bit 31. (ushort) loads bits 15—0 with the data and zero extends to bit 31. (long) is a 32-bit 
transfer that loads bits 31—0 with the data (see below). This instruction disables interrupts for one 
instruction cycle. 


(1) This instruction moves data from a memory location addressed using the register indirect addressing mode to 
the register rD. Data can be referenced as 8-, 16-, or 32-bits. Post-incrementation or post-decrementation is a 
function of the data size (8-bit, +1; 16-bit, +2; 32-bit, +4). 


(2) This instruction moves data from a memory location addressed using the direct addressing mode (“L) to the 
register rD. L is a 16-bit unsigned integer that is concatenated with the upper 16-bits representing the location of 
on-chip memory resources (0x0000 or 0x5003, based on pcw(10]) to form the address. 


(3) This instruction moves data from an IO register to the register rD. The size of the selected 1O register and the 
transfer size must be the same. 


Destination CAU Register Bits Affected 
Description 
ATA 
Zero-fill bits 7—0, zero-extend upper 


16-bits 
DATA 
DATA | DATA Load bits 31—0. 

















Instruction Replacement Table 
















*rP, *rP++, *rP--, | 8-, 16-, or 32-bit memory location. ++ and -- are a function of 
“rP+4rl 
rl is an increment register (rl = rO—r22). 


the data size. 
iorS [| _ior0—ior15 _| One of the 10 registers. 


Replacement 
rO (hard-wired zero), or one of 22 CAU registers 
rP is amemory pointer (rP = ro—r22); when using the sp (r21), 
the size of the transfer must be long. 
16-bit number _ | Unsigned integer - used as a direct address 
RESTRICTION 








The CAU register loaded and the flags set as a result of the load cannot be referenced in the following instruction 
(See Section 4.4.1.3, Restriction 3— CAU and IO Register Load). 
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LOAD 


INSTRUCTION (cont.): rD = {(char),(byte),(hbyte), (short), (ushort),[(long)]} MEM 
rD = {(char),(byte),(hbyte), (short), (ushort),[(long)]}} *L 
rD = {(char),(byte),(hbyte), (short), (ushort),[(long)]}_ iorS 

INSTRUCTION FORMAT (1) (Format 7c) 


Bit 31| 30—26 | 25] 24 | 23-21| 20-16 | 15—11 | 10| 9—5 4-0 
Field [aif ootitf}ol| tr] w {| m [ w [ofoooooy a | 


INSTRUCTION FORMAT (2) (Format 7a) 


Bit 31| 30-26 | 25| 24 | 23-21| 20-16 15—0 
Flokd qaqa: [r30.0ctsta1 sas) cue [umm Win [is ot coe s 


INSTRUCTION FORMAT (3) (Format 7b) 


Bit 31| 30-26 | 25] 24 | 23~-21| 20-16 |_ 15-11 | 10] 9—5 4-0 
Field [if oo117;ovT Tt {[ w [ nm [ooo00 oooo0 | ior | 


T Field: Specifies the direction of the transfer (to a register). 

W Field: — Specifies the size of the transfer. 

tH Field: Specifies the source/destination CAU register (destination). 
rP Field: Specifies the memory pointer register. 

tl Field: Specifies the increment register. 

L Field: Specifies a 16-bit unsigned integer. 

iorH Field: Specifies the source/destination IO register (source) 


EXAMPLE (For field encodings refer to Chapter 10.) 


110 = (ushort) *Oxaa 


Bit 31 | 30-26 | 25| 24 | 23~21| 20-16 15—0 
Encoding [0 [00711] Of 0 | 010 [o71010] _0000000010101010 
Meaning [_[ |__| (to) [(ushor)[ rio OxO0aa 
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LOAD-IOR 


INSTRUCTION: __ iorD = {(byte),(short),[(long)]} MEM 
CLASS: CA-Data Move 

CAU FLAGS AFFECTED: None 

DESCRIPTION 


This instruction moves data from a memory location to an IO register. The default size of the transfer is (long) 
and does not have to be specified. The size of the transfer and the IO register bits affected by the transfer are 
specified by the key words: (byte), (short), and (long). (byte) is an 8-bit transfer that loads bits 7—0 with the data 
and does not affect bits 8 to bit 31. (short) is a 16-bit transfer that loads bits 15—0 with the data and does not 
affect bits 16 to 31. (long) is a 32-bit transfer that loads bits 31—0 with the data (see below). This instruction 
disables interrupts for one instruction cycle. 


Destination 10 Register Bits Affected 


DATA | Load bits 7—0 (X means not affected) 


DATA | Load bits 15—0 























Instruction Replacement Table 
Replacement [Meaning 
“rP, *rP++, *rP--, | 8-, 16-, or 32-bit memory location. ++ and -- are a function of the 
*rP++rl data size. 

rP is a memory pointer (rP = r0—r22); when using the sp (r21), 
the size of the transfer must be long. 
rlis an increment register (rl = rO—r22). 
One of the IO registers 














[ior] ioro—iort5_| 


RESTRICTION 





The IO register loaded cannot be referenced in the following instruction (See Section 4.4.1.3, Restriction 3- CAU 
and IO Register Load). A dauc load from memory must be followed by a CA instruction. 


INSTRUCTION FORMAT (Format 7d) 


Bit 31| 30-26 | 25| 24 | 23—-21| 20-16 | 15-11 | 10] 95 4-0 
Feld [if ootit[i{ tf wy ion Tm [ofooooof A | 


T Field: Specifies the direction of the transfer (to a register). 

W Field: Specifies the size of the transfer. 

tP Field: Specifies the memory pointer register. 

rl Field: Specifies the increment register. 

iorH Field: Specifies the source/destination IO register (destination). 


EXAMPLE (For field encodings refer to Chapter 10.) 


ior8 = (short) *r2 


Bit 31| 30—26 | 25| 24 | 23-21] 20-16 | 15-11 | 10] 9-5 4-0 
Encoding [7 [ 00717[ 17] 0 | 077 [01000] 00010] 0 | 00000] 00000 
Meaning [| —s| | to | (short) [ore [2] TT 





NOP 


INSTRUCTION: nop 
CLASS: CA-Control 

CAU FLAGS AFFECTED: None 
DESCRIPTION 


This instruction is a no-op instruction. This instruction causes execution of the following instruction to be delayed 
by one processor cycle. This is a special case of the if(COND) goto instruction in which the branch is never 
taken. 


INSTRUCTION FORMAT (Formats 0b and 1b) 


Bit 31—27 26—21 20—16 15—0 
Field [=1:00 000 |= anGatees|eeeatBiae e| ees NO | 
C Field: Specifies the condition (false). 
rB Field: Specifies the base address register (r0). 
N Field: Specifies a 16-bit 2's complement integer used as an offset value for the 
base address (zero). 


EXAMPLE (For field encodings refer to Chapter 10.) 












nop 

Bit 31—27 26—21 20—16 15—0 

Encoding 10000 000000 00Q00 0000000000000000 
Meaning [| fase [ro 0x0000 





oc 


INSTRUCTION: = [Z =] aN = oc(Y) 
CLASS: DA-Special Function 

DAU FLAGS AFFECTED: None 
DESCRIPTION 


This instruction converts the floating-point operand specified by the Y field to an 8-bit 1-law, A-law, or unsigned 
linear value. The data type is specified by dauc register bits 3 and 1: p-law (dauc[3,1]=0,1), A-law 

(dauc[3,1] = 0,0), unsigned (dauc[3,1] = 1,x). Conversion to unsigned can be done using one of three modes 
specified by the dauc register bits 4 and 5: round to nearest (dauc[5—4] = 00), truncate towards -co 

(dauc[5—4] = 10), or truncate towards zero (dauc[5—4] = 11). The default (on reset) value of the dauc register is 
0.The Y operand can be a float memory location or an accumulator. The result is stored in the most significant 
byte of an accumulator (aN). The lower portion of the accumulator is written with unpredictable data. If the value 
of Y is outside the range of allowable y-law/A-law/8-bit unsigned numbers, the limiting 1-law/A-law/8-bit unsigned 
value is returned. For more information see 8.2.4 Type Conversions. No flags are set if this saturation occurs. If 
aZ field is specified, the result is stored to a byte address in memory. When the post-increment or post- 
decrement addressing mode is used for the Z operand, the value of the modifier is +1 or —1, respectively. In order 
to execute the store as 8-bits, the Z-write must be performed in this instruction. 


Instruction Replacement Table 


| Replace _|._ —_-Value\= _ si] eit inerecaaer) mecsuiMeaning: I 
[aN _[ad0—a3__—_—‘| One of the four DAU accumulators 


x. “rP, *rP++, 32-bit memory location. ++ is +4 and -- is —4. 
*rP--, *rP++rl, ¢P refers to ri—14 and is used as a memory pointer. 


rl refers to r15—19 and is used as an increment register. 
One of the four DAU accumulators. 

8-bit memory location. ++ is +1 and -- is -1. 

rP refers to ri—14 and is used as a memory pointer. 

rl refers to r15—19 and is used as an increment register. 





LATENCY 
For latency effects refer to Section 4.4.2. 


INSTRUCTION FORMAT (Format 5) 


Bit 31—27 26—23 22-2) 20—14 13—7 6—0 
Field {ESR ig fk | ees N=] (010: 0 1030: 010 |] ee eas | eee 
G Field: Specifies the special function instruction type (oc). 
N Field: Specifies the accumulator used for result. 
Y Field: Specifies adder input via accumulator direct or register indirect addressing modes. 
Z Field: Specifies destination of result via register indirect addressing modes. 


EXAMPLE (For field encodings refer to Chapter 10.) 


at = oc(a1) 

Bit 31-27 26—23 2221 20—14 137 6—0 
Encoding 01717 | 0001 | 01 |0000000[0000001| 0000111 
Ley cme] SRST a (ay | (Ee eel ae 





OR 


INSTRUCTION: —rD ={[(long)],(short)} rD | N (1) 
[if(COND)] rD = {{(long)],(short)} rS1|rS2_ (2) 


CLASS: CA-Arithmetic/Logic 
CAU FLAGS AFFECTED: nz,v=c=0 
DESCRIPTION 


These instructions perform bit-by-bit, logical OR operations. The items in brackets [] are optional. A 32-bit 
operation, (long), is the default size and does not have to be specified. (short) indicates that a 16-bit operation is 
desired. Flags are set according to the 16- or 32-bit operation flag rules (see Table 4-1), as specified by the 
instruction. 

(1) This instruction performs a bit-by-bit, logical OR operation on the contents of register rD with the value N and 
the result is placed in the register rD. N is a 16-bit 2's complement number that is MSB-extended to 32 bits. 


(2) This instruction performs a bit-by-bit, logical OR operation on the contents of register rS1 with the contents of 
register rS2, and the result is placed in the register rD. If the instruction is conditionally executed and the COND 
is true, the result of the operation on rS1 and rS2 is placed in the register rD, and the flags are set. If the 
instruction is conditionally executed and the COND is false, neither the register rD nor the flags are affected. The 
COND field is the same as for the add operations (see page 4-21 ADD). 


Tem [Replacement [Meaning 
[| —120 | (hard-wired zero), or one of 22 CAU registers =| 
Si, 182 
LN [teitnumber [2's complementinteger 


INSTRUCTION FORMAT (1) (Format 6c—d) 


Bit Sf 30—25 24—21 | 20-16 15—0 
Feld |_2 eS momrnodel eee) Parapet Ne 


INSTRUCTION FORMAT (2) (Format 6a—b) 














Bit 31 30—25 24—21 20—16 15—11 10—5 4—0 
Field as On ee es Se a 
E Field: Specifies the size (16/32) of the ALU operation. 
1D Field: Specifies the destination register. 
rS1,rS2 Fields: Specifies the source register. 
N Field: Specifies a 16-bit 2's complement integer that is MSB extended to 32-bits. 
F Field: Specifies the ALU function to be performed (|). 
C Field: Specifies the condition being tested. 


EXAMPLE (For field encodings refer to Chapter 10.) 


115 = (long) r15 | 0x55 
























Bit 31 30—25 24—21 15—0 
Encoding [7 | 001107 | 1070 [07117] 0000000001010101 
Meaning [_(long) | l 0x0055 








RETURN 


INSTRUCTION: return (rM) 
CLASS: CA-Control 

CAU FLAGS AFFECTED: None 
DESCRIPTION 


This instruction moves the value of rM into the program counter (pc). This is the special case of the unconditional 
goto instruction where only the CAU registers can be selected. Register rM holds the return address of a 
subroutine. This instruction is encoded and executes as if (true)goto rM. This instruction disables interrupts 
for one instruction cycle. 


Instruction Replacement Table 
Replacement 
One of 22 CAU registers. 










LATENCY 


When the above instruction is executed, the instruction immediately following is also executed before the branch 
occurs. For details on latency effects refer to Section 4.4.2. 


INSTRUCTION FORMAT (Formats 0b and 1b) 


Bit 31-27 26—21 20—16 
Field ( Se as a aS) 


C Field: Specifies the condition (true). 

rB Field: Specifies the base address register (rM). 

N Field: Specifies a 16-bit 2's complement integer used as an offset value for the 
base address (zero). 


EXAMPLE (For field encodings refer to Chapter 10.) 


return (r18) 


Bit 31-27 26—21 20—16 15—0 
Encoding 70000 | 000001 | 10100 [0000000000000000 
Meaning [== ee] ie 0x0000 





ROTATE LEFT 


INSTRUCTION: _ [if (COND)] rD = {[(long)],(short)} rS <<<1 
CLASS: CA-Arithmetic/Logic 

CAU FLAGS AFFECTED: nzve 

DESCRIPTION 


This instruction performs rotate left through carry, i.e., shifts the contents of register rS to the left by one bit and 
shifts the carry from the previous instruction into bit 0. The result is placed in the register rD. The items in the 
brackets [] are optional. A 32-bit operation, (long), is the default size and does not have to be specified. (short) 
indicates that a 16-bit operation is desired. For the (short) operation, the result is sign extended from bit 15 to bit 
31. Flags are set according to the 16- or 32-bit operation flag rules (see Table 4-1), as specified by the 
instruction. If the instruction is conditionally executed and the COND is true, the result is placed in the register 
rD, and the flags are set. If the instruction is conditionally executed and the COND is false, neither the register rD 
nor the flags are affected. The COND field is the same as for the add operations (see page 4-21 ADD). 


Instruction Replacement Table 
Replacement 







| m | rO—r22 _| rO (hard-wired zero), or one of 22 CAU registers 
[1s | 0-122 _—_—| rO (hard-wired zero), or one of 22 CAU registers 
NSTRUCTION FORMAT (Format 6a—b) 
Bit 31 30—25 24—21 20—16 15—11 10—5 4—0 
Field CRASS Se ae De a a OE 
E Field: Specifies the size (16/32) of the ALU operation. 
1D Field: Specifies the destination register. 
rS Field: Specifies the source register. 
F Field: Specifies the ALU function to be performed (<<<). 
C Field: Specifies the condition being tested. 


EXAMPLE (For field encodings refer to Chapter 10.) 


if(cs) r5 = 16 <<<1 
Bit 

Encoding 
Meaning 








ROTATE RIGHT 


INSTRUCTION: [if (COND)] rD = {[(long)], (short)} rS >>>1 
CLASS: CA-Arithmetic/Logic 

CAU FLAGS AFFECTED: nzc, v=0 

DESCRIPTION 


This instruction performs rotate right through carry, i.e., shifts the contents of register rS to the right by one bit and 
shifts the carry from the previous instruction into the most significant bit (MSB). The result is placed in the register 
rD. The items in the brackets [ ] are optional. A 32-bit operation, (long), is the default size and does not have to 
be specified. (short) indicates that a 16-bit operation is desired. For the (short) operation, the carry is shifted into 
bit 15, and the result is sign extended from bit 15 to bit 31. Flags are set according to the 16- or 32-bit operation 
flag rules (see Table 4-1), as specified by the instruction. If the instruction is conditionally executed and the 
COND is true, the result is placed in the register rD, and the flags are set. If the instruction is conditionally 
executed and the COND is false, neither the register rD nor the flags are affected. The COND field is the same 
as for the add operations (see page 4-21 ADD). 


[Term] Replacement [Meaning 
[Ss [12210 (hard-wired zero), or one of 22 CAU registers 


NSTRUCTION FORMAT (Format 6a—b) 















Bit 31 30—25 24—21 20—16 15—11 10—5 4—0 
Field Bers Se ae a Re es SS aa El 
E Field: Specifies the size (16/32) of the ALU operation. 
1D Field: Specifies the destination register. 
'S Field: Specifies the source register. 
F Field: Specifies the ALU function to be performed (>>>). 
C Field: Specifies the condition being tested. 


EXAMPLE (For field encodings refer to Chapter 10.) 


if(cs) r5 = 16 >>>1 


Bit 31 30—25 24—21 | 20-16 | 15-11 10—5 4—0 
Encoding [__1 | 007700 | 700% [00101 [00110 [001001] 00110] 
Meaning [long [OT > TS Ts CT cs 





ROUND 


INSTRUCTION: —[Z =] aN = round(Y) 
CLASS: DA-Special Function 

DAU FLAGS AFFECTED: NZVU 
DESCRIPTION 


This instruction rounds the floating-point operand Y to 32-bit floating-point format. It is intended to operate on the 
contents of an accumulator that contains eight extra bits of precision in the mantissa (although other operands are 
allowed). The 40-bit operand is rounded to the nearest 32-bit representation and the result is stored in an 
accumulator (aN). The eight guard bits in the accumulator aN are set to zero. If the 40-bit operand is exactly 
between two 32-bit values, it is rounded to the greater value. The Y operand can specify an accumulator or a 
32-bit memory location. If a Z field is specified, the result is also stored to a word address in memory. 


Instruction Replacement Table 


g 
[aN [ad0—a3_—_—_| One of the four DAU accumulators 
Y “rP, *rP++, 32-bit memory location. ++ is +4 and -- is —4. 
*rP--, *rP++rl, rP refers to r1i—14 and is used as a memory pointer. 
tl refers to r15—19 and is used as an increment 


register. 

a0—a3 One of the four DAU accumulators 

“rP, *rP++, 32-bit memory location. ++ is +4 and -- is —4. 

“rP--,*rP+4rl rP refers to ri—14 and is used as a memory pointer. 
rl refers to r15—19 and is used as an increment 
register. 





LATENCY 
For latency effects refer to Section 4.4.2. 


INSTRUCTION FORMAT (Format 5) 





Bit 31—27 26—23 22—21 20—14 US newb 6—0 
Field Loiiii [ G [ N_ [oo00000 
G Field: Specifies the special function instruction type (round). 





N Fiel Specifies the accumulator used for result. 

Y Field: Specifies adder nput via accumulator direct or register indirect addressing 
modes. 

Z Field: Specifies destination of result via register indirect addressing modes. 


EXAMPLE (For field encodings refer to Chapter 10.) 


*112++116 = a3 = round(*r1--) 


Bit 31-27 26—23 2221 20—14 137 6—0 
Encoding 07100 | 17 [0000000|0001110[1100010 
Meaning [___ | round [as rt] 





SEED 


INSTRUCTION: [Z =] aN = seed(Y) 

CLASS: DA-Special Function 

DAU FLAGS AFFECTED: NZU, V=0 

DESCRIPTION 

This instruction produces a seed (reciprocal of Y) with mantissa good to three bits of accuracy by inverting all bits 
of Y, except the sign bit. The floating-point operand Y can specify a float memory location or an accumulator. 


The result is stored in an accumulator (aN). If a Z field is specified, the result is also stored to a word address in 
memory. 


Instruction Replacement Table 


g 
[aN _[a0—a3__—_—_| One of the four DAU accumulators 


vi “rP, *rP++, 32-bit memory location. ++ is +4 and -- is —4. 

“rP--, *rP++rl, rP refers to ri—14 and is used as a memory pointer. 
rl refers to r15—19 and is used as an increment 
register. 

ad—a3 One of the four DAU accumulators. 

Z 


“cP, *rP++, 32-bit memory location. ++ is +4 and -- is —4. 


“rP--, *rP++rl rP refers to ri—14 and is used as a memory pointer. 





LATENCY 
For latency effects refer to Section 4.4.2. 


INSTRUCTION FORMAT (Format 5) 
Bit 31—27 26—23 22—21 13—7 


20—14 6-0 
Field [oiiit [ GN [o000000 


G Field: Specifies the special function instruction type (seed). 

N Field: Specifies the accumulator used for result. 

Y Field: Specifies adder input via accumulator direct or register indirect addressing modes. 
Z Field: Specifies destination of result via register indirect addressing modes. 





EXAMPLE (For field encodings refer to Chapter 10.) 


a3 = seed(*r9++) 


Bit 6—0 
Encoding 0000000/1001111 [0000111 
[No write] 


Meaning a3 











SET 


INSTRUCTION: rD = (short) N 
CLASS: CA-Data Move 

CAU FLAGS AFFECTED: nz, v=c=0 
DESCRIPTION 


This instruction moves the value N into the register rD. N is a 16-bit 2's complement number that is MSB- 
extended to 32 bits. Flags are set according to 16-bit flag rules (see Table 4-1). The operation actually 
performed is rD = r0 +N. 


rO—122 
[N | t6-bitnumber [2s complementinteger 


INSTRUCTION FORMAT (Format 5a) 


Bit 31—26 25—21 20—16 15—0 
Field Ti SS Sa 
1D Field: Specifies the destination register. 


rS Field: Specifies the source register (r0). 
N Field: Specifies a 16-bit unsigned integer that is sign-extended to 32-bits. 






EXAMPLE (For field encodings refer to Chapter 19.) 


12= (short) 0x0800 


Bit 31—26 25—21 20—16 15—0 
Encoding 000101 | 00010 | 00000 0000100000000000 
Meaning: 5 |ess ee eres tye 0x0800 
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SET24 


INSTRUCTION: _rD = (ushort24) M 
CLASS: CA-Data Move 

CAU FLAGS AFFECTED: None 
DESCRIPTION 


This instruction moves the value M into register rD. M is a 24-bit unsigned integer that is zero extended to 32-bits. 


Replacement 
Se aS 
[__M___| 24bitnumber_[unsignedinteger 


INSTRUCTION FORMAT (Format 8b) 


Bit 31—29 28—21 20—16 15—0 
Field Et aS ee Ss ee ee) 


rD Field: Specifies the destination register. 

NE Field: Specifies the most significant 8-bits of the 24-bit integer. NE concatonated 
with N forms the 24-bit unsigned integer (NE || N). 

N Field: Specifies the least significant 16-bits of the 24-bit unsigned integer. 






EXAMPLE (For field encodings refer to Chapter 10.) 

r2= (ushort24) 0x080000 2 

Bit 31—29 28—21 20—16 15—0 

Encoding [110 | 00001000 00010 0000000000000000 
Meaning [ [| _oxog_ [2 0x0000 
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SFTRST 


INSTRUCTION: _ sftrst 

CLASS: CA-Control 

CAU FLAGS AFFECTED: None 

DESCRIPTION 

This instruction performs a soft reset of the processor level. It is intended to be used only from the error level and 
it returns the processor to the base level. By resetting registers and branching to zero an emulation of reset can 
be accomplished. This instruction is encoded as spc=(byte)r0. 


INSTRUCTION FORMAT (Format 75) sftrst 


Bit 31 | 30—26 | 25| 24 | 23-21] 20-16 | 15—11 | 10 S=6i 
Field [1 [oo1ii{/ 0] 1 | 000 [00000] 000007 1 | 00000 
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SHIFT LEFT 


INSTRUCTION: rD = {{(long)], (short)} rD << N (1) 
[if(COND)] rD = {{(long)], (short)} rS1 << rS2 (2) 
[if(COND)] rD = {[(long)], (short)} rS << 1 (3) 


CLASS: CA-Arithmetic/Logic 
CAU FLAGS AFFECTED: nz, v=c=0 
DESCRIPTION 


These instructions perform logical left shift operations. The items in brackets [] are optional. A 32-bit operation, 
(long), is the default size and does not have to be specified. A (long) left shift can be considered an arithmetic left 
shift, but note that overflow is not detected. (short) indicates that a 16-bit operation is desired. The (short) left 
shift is a logical left shift and the upper 16 bits of the result are zero. Flags are set according to the 16- or 32-bit 
operation flag rules (see Table 4-1), as specified by the instruction. If the instruction is conditionally executed and 
the COND is true, the result is placed in the register rD, and the flags are set. If the instruction is conditionally 
executed and the COND is false, neither the register rD nor the flags are affected. The COND field is the same 
as for the add operations (see page 4-21 ADD). 

(1) This instruction performs a logical left shift on the contents of register rD by N and the result is placed in the 
register rD. Only the least significant 5 bits of N are used to specify a shift of 0 to 31 binary places. 


(2) This instruction performs a logical left shift on the contents of register rS1 by the contents of register rS2, and 
the result is placed in the register rD. Only the least significant 5 bits of rS2 are used to specify a shift of 0 to 31 
binary places. 


(3) This instruction performs a conditional logical left shift by one of the register rS and the result is placed in the 
register rD. . 












Instruction Replacement Table 
[Term | Replacement [Meaning 
rO—122 
[ _N | __16-bitnumber _| 2's complement integer 


INSTRUCTION FORMAT (1) (Format 6-—d) 


Bit 31 30—25 24-21 | 20-16 15—0 
ee eee CS EE a a (ee 


INSTRUCTION FORMAT (2-3) (Format 6a—b) 









Bit 31 30—25 24—21 20—16 15—11 10—5 4—0 
Field BS RE Se ee Re ee 
E Field: Specifies the size (16/32) of the ALU operation. 
rD Field: Specifies the destination register. 
rS1, rS2 Fields: Specifies the source register. 
N Field: Specifies a 16-bit 2's complement integer that is MSB extended to 32-bits. 
F Field: Specifies the ALU function to be performed (<<). 
C Field: Specifies the condition being tested. 
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SHIFT LEFT 


INSTRUCTION (cont.): — rD = {[(long)], (short)} rD << N 
[if(COND)]} rD = {[(long)], (Short)} rS1 << rS2 
[if(COND)] rD = {[(long)], (short)} rS << 1 


EXAMPLE (For field encodings refer to Chapter 10.) 
iff(ane) r3 = (short) r2 << 1 
Bit 31 30—25 24-21 20—16 15—11 10—5 4—0 


Encoding [ 0 | 001100 | 1100 [00011] 00010 /010100/ 10111 
Meat ict short) nomen Pa [toc ae a fea mer Pear mene oes ( 1) 
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SHIFT-OR 


INSTRUCTION:  rD =[(long)] rS <<|N 
CLASS: CA-Arithmetic Logic 


CAU FLAGS AFFECTED: nz, v=c=! 


DESCRIPTION 


This instruction is a shift and OR instruction. The value N is logically shifted to the left by 16 binary places and 
OR’ed with the contents of rS and the result is stored in the register rD. The instruction is always executed as 32- 
bits, (long). Specification of (long) is optional. Flags are set according to 32-bit flag rules. 






Instruction Replacement Table 
Term] Replacement] Meaning, 


r0—r22 10 (hard-wired zero), or one of 22 CAU registers 












rs r0—r22 10 (hard-wired zero), or one of 22 CAU registers 


[ N [16-bit number 2's complement integer 


INSTRUCTION FORMAT (Format 4b) 


Bit 
Field 


31—26 25—21 20—16 
Di Sa aS 
1D Field: Specifies the destination register. 


tS Field: Specifies the source register. 
N Field: Specifies a 16-bit 2's complement integer. 


EXAMPLE (For field encodings refer to Chapter 10.) 


12= (long) r1 <<| 0x0800 


Bit 


31—26 25—21 20—16 15—0 


Encoding 700100 | 00010 | 00001 0000100000000000 
Ll) al enn ey Sey aa 0x0800 
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SHIFT RIGHT 


INSTRUCTION: —_rD = {[(long)], (short)} rD >> N (1) 
[if(COND)] rD = {[(long)], (short)} rS1>>rS2_ (2) 
[if(COND)} rD = {[(long)}, (short)} rS >> 1 (3) 


CLASS: CA-Arithmetic/Logic 
CAU FLAGS AFFECTED: nz, v=c=0 
DESCRIPTION 


These instructions perform logical right shift operations. The items in brackets [] are optional. A 32-bit operation, 
(long), is the default size and does not have to be specified. (short) indicates that a 16-bit operation is desired. 
For the (short) logical right shift, the upper 16 bits of the result are zero. Flags are set according to the 16- or 32- 
bit operation flag rules (see Table 4-1), as specified by the instruction. If the instruction is conditionally executed 
and the COND is true, the result is placed in the register rD, and the flags are set. If the instruction is 
conditionally executed and the COND is false, neither the register rD nor the flags are affected. The COND field 
is the same as for the add operations (see page 4-21 ADD). 


(1) This instruction performs a logical right shift on the contents of register rD by N and the result is placed in the 
register rD. Only the least significant 5 bits of N are used to specify a shift of 0 to 31 binary places. 


(2) This instruction performs a logical right shift on the contents of register rS1 by the contents of register rS2, and 
the result is placed in the register rD. Only the least significant 5 bits of rS2 are used to specify a shift of 0 to 31 
binary places. 


(3) This instruction performs a conditional logical right shift by one of the register rS and the result is placed in the 
register rD. . 
















instruction Replacement Table 


[Term | Replacement Meaning 


Term 
eee TO (hard-wired zero), or one of 22 CAU registers 
rS, rS1, rS2 TO (hard-wired zero), or one of 22 CAU registers 


16-bit number 2's complement integer 


INSTRUCTION FORMAT (1) (Format 6c—d) 


Bit 31 30—25 24—21 | 20-16 15—0 
Go | SSS SUPE n Rae ee an a 


INSTRUCTION FORMAT (2-3) (Format 6a—b) 










Bit 31 30—25 24—21 20—16 15—11 10—5 4—0 
Field Se hii Se a Se ea eae) 

E Field: Specifies the size (16/32) of the ALU operation. 

1D Field: Specifies the destination register. 

1S1, rS2 Fields: Specifies the source register. 

N Field: Specifies a 16-bit 2's complement integer that is MSB extended to 32-bits. 

F Field: Specifies the ALU function to be performed (>>). 

C Field: Specifies the condition being tested. 
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SHIFT RIGHT 


INSTRUCTION (cont.): —_ rD = {[(long)], (short)} rD >> N 
[if(COND)] rD = {[(long)], (short)} rS1 >> rS2 
[if(COND)] rD = {[(long)}, (Short)} rS >> 1 


EXAMPLE (For field encodings refer to Chapter 10.) 


12 = (long) r2 >> 30 


Bit 31 30—25 24—21 | 20-16 15—0 
Encoding [_1 | 007107 | 1100 | 00010[ 0000000000011110 
Cy) A es LS a | eT) 
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SHIFT RIGHT-ARITH 


INSTRUCTION: _rD = {[(long)], (Short)} rD $>>N 
[if(COND)] rD = {{(long)], (short)} rS1 $>> rS2 
{if(COND)] rD = {[(long)], (short)} rS /2 


CLASS: CA-Arithmetic/Logic 
CAU FLAGS AFFECTED: nz, v=c=0 
DESCRIPTION 


These instructions perform arithmetic right shift operations, The items in brackets [] are optional. A 32-bit 
operation, (long), is the default size and does not have to be specified. (short) indicates that a 16-bit operation is 
desired. For the (short) arithmetic right shift, the result is sign extended from bit 15 to bit 31. Flags are set 
according to the 16- or 32-bit operation flag rules (see Table 4-1), as specified by the instruction. If the instruction 
is conditionally executed and the COND is true, the result is placed in the register rD, and the flags are set. If the 
instruction is conditionally executed and the COND is false, neither the register rD nor the flags are affected. The 
COND field is the same as for the add operations (see page 4-21 ADD). 


(1) This instruction performs an arithmetic right shift on the contents of register rD by N and the result is placed in 
the register rD. Only the least significant 5 bits of N are used to specify a shift of 0 to 31 binary places. 


(2) This instruction performs an arithmetic right shift on the contents of register rS1 by the contents of register rS2, 
and the result is placed in the register rD. Only the least significant 5 bits of rS2 are used to specify a shift of 0 to 
31 binary places. 


(3) This instruction performs a divide-by-two operation; i.e., the arithmetic right shift with sign extension. The 
contents of register rS are divided by two, and the result is placed in register rD. 


Term | Replacement [Meaning 
D 
1S, $1, 182 


INSTRUCTION FORMAT (1) (Format 6c—d) 
Bit 31 30—25 24—21 20—16 15—0 


fo | san Onn el ae Ea al a 
INSTRUCTION FORMAT (2-3) (Format 6a—b) 














Bit 31 30—25 24—21 20—16 15—11 10—5 4—0 
Field eS Sn a Ss ae a a 
E Field: Specifies the size (16/32) of the ALU operation. 
rD Field: Specifies the destination register. 
1S1, rS2 Fields: Specifies the source register. 
N Field: Specifies a 16-bit 2's complement integer that is MSB extended to 32-bits. 
F Field: Specifies the ALU function to be performed ($>>). 
C Field: Specifies the condition being tested. 
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SHIFT RIGHT-ARITH 


INSTRUCTION (cont.): —_rD = {[(long)], (short)} rD $>> N 
[if(COND)] rD = {[(long)], (short)} rS1 $>> rS2 
[if(COND)] rD = {[(long)], (short)} rS /2 


EXAMPLE (For field encodings refer to Chapter 10.) 


4 = (long) r2 $>> 13 
Bit 31 30—25 24—21 20—16 15—11 10—5 


4-0 
Encoding [_ 7 | 007100 | 1101 | 00100 | 00010 | 000001| 01101] 
Meaning, \[ittongya sr | Ss | era 2 ten [arise 
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STORE 


INSTRUCTION: MEM =({(byte), (hbyte), (short), [(long)]} rS_— (1) 
*L_ = {(byte), (hbyte), (short), [(long)}} rS_(2) 
iorD = {(byte), (hbyte), (short), [(long)} rS_— (3) 


CLASS: CA-Data Move 
CAU FLAGS AFFECTED: None 
DESCRIPTION 


This instruction moves data from the register rS to a memory location or 10 register specified on the left-hand 
side. The default size of the transfer is (long) and does not have to be specified. The CAU register bits selected 
and the size of the transfer are specified by the key words: (byte), (hbyte), (short), and (long). (byte) and (hbyte) 
are 8-bit transfers that select register bits 7—0 and 15—8 respectively. (short) is a 16-bit transfer that selects 
register bits 15—O. (long) is a 32-bit transfer that selects registers 31—0 (see below). 


(1) This instruction moves data from the register rS to a memory location addressed using the register indirect 
addressing mode. Data can be written as 8-, 16-, or 32-bits. Post-incrementation and post-decrementation is a 
function of the data size (8-bit, +1; 16-bit, +3; 32-bit, +4). 


(2) This instruction moves data from the register rS to a memory location addressed using the direct addressing 
mode (*L). Lis a 16-bit unsigned integer that is concatenated with the upper 16-bits representing the location of 
‘on-chip memory resources (0x0000 or 0x5003, based on pcw[10}) to form the address. 


(3) This instruction moves data from the register rS to an IO register. The transfer size and the size of the 
selected IO register must be the same. 






Source, CAU Register Bits Selected 
[si—24 | 23-16 
aa ess 
| ees | eee XE ESIDATA 
[x | x | DATA | DATA [Store bits 15-0 | 
DATA DATA DATA DATA 
























Instruction Replacement Table 


Replacement 


r0—r22, pc, pesh | r0 (hard-wired zero), one of 22 CAU registers, the program 
counter, or the program counter shadow register 


“rP, *rP++, ‘fP--, | 8-, 16-, or 32-bit memory location. 
“rP+4rl ++ is +1, +2, or +4 for data transfer sizes of 8-, 16-, and 32-bits. 
-- ig -1, -2, and —4 for data transfer sizes of 8-, 16-, and 32-bits. 
rP is a memory pointer (rP = rO—r22); when using the sp (r21), 
the size of the transfer must be long. 
rlis an increment register (rl = rO—r22). 












[_ior0—ior15 _| One of the IO registers 
16-bit number 


Unsigned integer - used as a direct address 





RESTRICTION 


AAU or IO register store instruction cannot immediately follow a DA instruction (multiply/accumulate or special 
function) that references memory in the Y field (see Section 4.4.1.2, Restriction 2— CAU and IO Register Store). 
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INSTRUCTION (cont.): MEM = {(byte), (hbyte), (short), [(long)]} rS 
*L_ = {(byte), (hbyte), (short), [(long)]} rS 
iorD = {(byte), (hbyte), (short), [(long)]} rS 


INSTRUCTION FORMAT (1) (Format 7c) 


Bit 31| 30-26 | 25| 24 | 23~21| 20-16 | 15-11 | 10] 95 
Field Lif ooiiifofT rt] wif{onm [ we Tofooooo nies 


INSTRUCTION FORMAT (2) (Format 7a) 


Bit 31| 30-26 | 25| 24 | 23~21| 20-16 15—0 
Field”... [50 peste a]+00) sate] a We 1] cS rH to jue a Sete bene re a 


INSTRUCTION FORMAT (3) i si 


Bit 31| 30-26 | 25 23—21 | 20-16 | 15-11 | 10] 9-5 
Field [i] 00111 as ooooo|t1{oo000 a 


T Field: Specifies the direction of the transfer (to). 

WField: — Specifies the size of the transfer. 

tH Field: Specifies the source/destination register (source). 

tP Field: Specifies the memory pointer register. 

rlField: Specifies the increment register. 

iorH Field: Specifies the source/destination IO register (destination). 
L Field: Specifies a 16-bit unsigned integer. 


EXAMPLE (For field encodings refer to Chapter 10.) 


“r++ = (hbyte) 3 


Bit 31| 30-26 | 25| 24 | 23-21] 20-16 | 15—11 | 10 
Encoding | 1 [| 00111] 07 1 [ 100 [00011] 00001 | 0 | saten 
Meaning [TT trom] (hoyte) [srt 


a 


STORE 





STORE-IOR 


INSTRUCTION: MEM = {(byte), (short), [(long)]} iorS 

CLASS: CA-Data Move 

CAU FLAGS AFFECTED: None 

DESCRIPTION 

This instruction moves data from an IO register to a memory location. The default size of the transfer is (long) 
and does not have to be specified. The size of the transfer and the IO register bits selected by the transfer are 
specified by the key words: (byte), (short), and (long). (byte) is an 8-bit transfer that selects bits 7—0. (short) isa 


16-bit transfer that selects bits 15—0 . (long) is a 32-bit transfer that selects bits 31—0 (see below). Post- 
incrementation and post-decrementation is a function of the data size (8-bit, 1; 16-bit, +2, ; 32-bit, +4). 













Source IO Register Bits Selected 
























| Type | Size | 31—24| [70 [Description 
roteytey | 8 | x | x | X | DATA |Selectbis7—o 


















Instruction Replacement Table 
8-, 16-, of 32-bit memory location. 
++ is +1, +2, or +4 for data transfer sizes of 8-, 16-, and 32-bits. 
-- ig -1, -2, and —4 for data transfer sizes of 8-, 16-, and 32-bits. 
rP is a memory pointer (rP = r0—r22); when using the sp (r21), 
the size of the transfer must be long. 
rlis an increment register (rl = rO—r22). 


One of the IO registers 


rm Replacement 


B “PS SrPet ter 
L_ iors _| 


*rP+srl 
RESTRICTION 









ACAU or IO register store instruction cannot immediately follow a DA instruction (multiply/accumulate or special 
function) that references memory in the Y field (see Section 4.4.1.2, Restriction 2— CAU and IO Register Store). 


INSTRUCTION FORMAT (Format 7d) 
Bit 31] 30-26 | 25| 24 | 23~-21| 20-16 | 15-11 |10|_ 9—5 4-0 
ea A a AC 


T Field: Specifies the direction of the transfer (from a register). 
WField: Specifies the size of the transfer. 

rP Field: Specifies the memory pointer register. 

tl Field: Specifies the increment register. 

iorH Field: Specifies the source/destination IO register (source). 


EXAMPLE (For field encodings refer to Chapter 10.) 


*r1 = (short) ior 



















Bit 30—26 20-16 | 15—11 | 10] 9-5 4-0 
Encoding OoO111| 1 00000] 00001] 0 | 00000| 00000 
Meaning [_[ TT from] i 2 Lae 
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SUBTRACT 


INSTRUCTION: 1D ={{(long)], (short)} rD - N (1) 
rD = {[(long)], (short)} N - rD (2) 
[if(COND)] rD = {[(long)], (short)} rS1 - rS2 (3) 
[i(COND)] rD = {{(long)]}, (short)} - rS (4) 


CLASS: CA-Arithmetic/Logic 
CAU FLAGS AFFECTED: nzve 
DESCRIPTION 


These instructions perform subtract operations. The items in brackets [] are optional. A 32-bit operation, (long), 
is the default size and does not have to be specified. (short) indicates that a 16-bit operation is desired. A (short) 
operation sign extends the result from bit 15 to bit 31. Flags are set according to the 16- or 32-bit operation flag 
tules (see Table 4-1), as specified by the instruction. If the instruction is conditionally executed and the COND is 
true, the result is placed in the register rD, and the flags are set. If the instruction is conditionally executed and 
the COND is false, neither the register rD nor the flags are affected. The COND field is the same as for the add 
operations (see page 4-32). 

(1) is a right subtract instruction. This instruction subtracts the value N from the contents of register rD and 
places the result in the register rD. N is a 16-bit 2's complement number that is MSB-extended to 32 bits. 


(2) is a left subtract instruction. This instruction subtracts the contents of register rD from a value N and places 
the result in the register rD. N is a 2's complement number that is MSB-extended to 32 bits. 


(3) is a conditional triadic subtract instruction. This instruction subtracts the contents of register rS2 from register 
rS1 and places the result in rD. 


(4) conditionally changes the sign of the contents of register rS. The result is placed in register rD. This negate 
instruction sets overflow flag v to one (v = 1) for rS = Oxffff8000 and sets carry flag c to zero 
(c = 0) for rS = 0x00000000; in all other cases v=0,c=1. 


[122 | r0 (hard-wired zero), or one of 22CAU registers | 
[ N | t6bitrumber [2s complementinteger 

















SUBTRACT 


INSTRUCTION (cont.): —_rD = {[(long)], (short)} rD - N 
rD = {[(long)]}, (short)} N - rD 
[if(COND)] rD = {[(long)], (short)} rS1 - rS2 
[if(COND)] rD = {{(long)], (short)} - rS 


INSTRUCTION FORMAT (1-2) (Format 6c—d) 


Bit 31 30—25 24-21 | 20—16 15—0 
Field EE BU aes a ae a 


INSTRUCTION FORMAT (3-4) (Format 6a—b) 





Bit 31 30—25 24—21 20—16 15—11 10- 

Field ee ge ee ea 
E Field: Specifies the size (16/32) of the ALU operation. 
rD Field: Specifies the destination register. 
1S1, rS2 Fields: Specifies the source register. 
N Field: Specifies a 16-bit 2's complement integer that is MSB extended to 32-bits. 
F Field: Specifies the ALU function to be performed (-). 
C Field: Specifies the condition being tested. 


EXAMPLE (For field encodings refer to Chapter 10.) 





if(mi) 10 = -r5 

Bit 31 30-25 24—21 | 20-16 | 15—11 10—5 4-0 
Encoding aa 007700 | 0100 [°01010 | 00000 | 000011) 00101 
Meaning Ss) ee a ee 
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WAITI 


INSTRUCTION: wait 
CLASS: CA-Control 

CAU FLAGS AFFECTED: None 
DESCRIPTION 


This instruction initiates the wait-for-interrupt mode. The wait-for-interrupt mode is a power-saving mode that 
stops operations in the CAU and DAU until an interrupt is recognized. If an interrupt is pending when the waifi 
instruction is executed, the waiti instruction is ignored. A nop instruction should always follow the waiti instruction 
because when the interrupt is recognized this instruction is executed before the interrupt is taken. It should only 
be executed from the base level. This instruction is encoded as spc=(long)r0. 


INSTRUCTION FORMAT (Format 7b) waiti 


Bit 31 S0— 28 25| 24 | 23-21| 20-16 | 15—11 | 10 
Field fooviifof 1 [117 Jooooof ooooo] 1 | Too [071010 | 
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5. SIGNAL DESCRIPTIONS 


This section contains a brief description of the signals on the DSP3210. Figure 5-1 illustrates the functional 
organization of these pins which are divided into six categories: memory interface, serial /O, bit /O, utility, 
interrupt, and power supply. Table 5-1 is a summary of the signal pins provided in the 132 POFP package. 


Table 5-1 Pin Summary 


Bus Interface DO—D31, A2—A31, MSO—MS3 66 
DRAM Signals CSN, RW, PBD/BLMN 3 
SRAM Signals ASN, MRN, MWN at 
Control SRDYN, BERRN, AEN, DEN, DLE 5 
Arbitration Signals BRN, BGN, BGACKN, LOCKN 4 

Serial /O DI, ICK, ILD, DO, OCK, OLD, SY 7 

Bit VO BIOO—BIO7 8 

Utility CKI, RESTN, ZN 3 

Interrupt IRON, IR1N, IACKO, IACK1 4 

Vss 20 

Vop ahh 

Total 132 


* PBD(BLMN) - Page Break Detect and Block Move share the same pin, selection is made via pew[14—15] 
+ AEN is multiplexed with MRN and DEN is multiplexed with MWN, selection is made via pew|12] 


ee < DATA» Do—D31 
uTILiTy ( RESTN - 


ZN 
ADDRESS > A2—A31 
IRON—IR1N lit 
INTERRUPT <<“ ACKO—IACKA s MEMORY 
SN INTERFACE 
RW 
BIT vO — 
(e) BIO0—BIO7 <—>| mpbielini 
DI AEN/MRN 
ICK DEN/MWN 
ILD DLE 
SERIAL VO DO SRDYN 
EK BERRN 
OLD 
sh LOCKN 
EBN BUS 
i Vop BGN ARBITRATION 
ROWER Se Vas BGACKN 


Figure 5-1 DSP3210 Signal Groups 
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A description of the function of each pin is presented in Table 5-2. 
Table 5-2 Pin Descriptions 


[ PinName | Type’ | __Name__| Brief Description 


DO—D31 VO(3) | External Memory | Bidirectional signals provide the general purpose data path 
Data Bus between the DSP3210 and system resources. The data bus 
transmits and receives 8-, 16-, and 32-bit data. 
Address Bus 


ie = Provides memory select information. Signaling based on pew(9], 
M/IN: M/IN=1: Memory selects convey data type and word offset. 
M/IN=0: Memory selects convey byte select information. 
| 0(3) [Cycle Start____| Indicates the beginning of a valid external memory transaction. _| 
| ASN___|' 0/3) _| Address Strobe __| Indicates that address and R/W information are valid. _ | 
RW O(3) | Read/Write Defines the direction of a data transfer. Signaling based on 

Pew[11], R/WN: R/WN=1: A high level indicates a read from an 
external device, a low level indicates a write to an external device. 
R/WN=0: A low level indicates a read from an external device, a 
high level indicates a write to an external device. 
AEN and MRN share one pin and the function is controlled by 

+ The functions of AEN and MAN are multiplexed on one pin, selection is made via pow( 12) 


PBD/BLMN | O(3) | Page Break Provides Page Break Detect or Block Move indication based on 
Detect/Block pew[15—14], P/BN. P/BN=01, 1X: Indicates that the current 
Move memory access is not on the same page/not sequential as the 
previous memory access. P/BN=0: Indicates the first word of 
pew[12], D/SN. When D/SN=0, MRN is selected. Indicates that 
+ The functions of DEN and MWN are multiplexed on one pin, selection is made via pew(12]. 



















Address Enable 



















quad-word transfer in a doblock sequence. 
AEN I 
MRN (3) | Read Strobe 
data should be placed on the data bus during read transactions. 


AEN and MRN share one pin and the function is controlled by 
DEN: f Data Enable DEN and MWN share one pin and the function is controlled by 
pew[12], D/SN. D/SN=1: DEN is selected. When asserted, DEN 
enables the data bus on writes. When negated, DEN tristates 
same. 
MWN O(3) | Write Strobe DEN and MWN share one pin and the function is controlled by 
pew[12], D/SN. When D/SN=0, MWN is selected. Indicates that 










Ppew[12], D/SN. When D/SN=1, AEN is selected. When asserted, 
AEN enables the address bus and memory selects, and when 
valid data is on the data bus during write transactions. 

| DLE [1 | DataLatch Enable| Enables data on the data bus into the input data latch on reads. 


negated, AEN tristates same. 
* |= Input, O = Output, P = Power, NC = No connection, OD = Open Drain, and (3) = 3-state. 
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Table 5-2 Pin Descriptions (continued) 


SRDYN 







Indicates that a data transfer is complete. When asserted, the 

Ready DSP3210 completes the current transaction. This is a synchronous 
input and setup and hold times must be met for reliable operation. 
SRDYN is used in conjunction with BERRN and BGN to control 
telinquish, retry, and bus error operations. 
Indicates that a data transfer has encountered a bus error. Itis 
used in conjunction with SRDYN and BGN to control relinquish, 


ar 
retry, and bus error operations. 
Asserted when the DSP3210 is requesting interlocked memory 
transactions. 


O(3) | Bus Request Asserted when the DSP3210 is requesting mastership of the bus. 
Itindicates that access to external memory is needed. 
BGN [1 |[BusGrant__| Grant for ownership of the bus. 


| _BGN__| 
BGACKN O(3) | Bus Grant Indicates that the DSP3210 has taken ownership of the data and 
Acknowledge address busses and control signals. When negated, it indicates 
BIOO-BIO7 | /O(3) | Bit VO 


that the DSP has placed data, address and control signals in the 
} or _{ 14 Serial Data In 


















Synchronous 














































Eight bidirectional signals that can be used for status and control 
information. Configured by bioc register, and data is transferred 
via bio register. 
Serial data input. 
Clock for serial data. In internal mode, ICK is generated on-chip 
(as specified in ioc) and is an output; in external mode, ICK is an 
input. 
“TSignal for loading input buffer from internal shift register. In internal 
mode, ILD is generated on-chip (as specified in loc) and is an 
output; in external mode, ILD is an input. 
Serial data output. 
Clock for serial data. In internal mode, OCK is generated on-chip 
(as specified in loc) and is an output; in external mode, OCK is an 
it 






































ILD 
‘Signal for loading input buffer from internal shift register. In internal 
mode,OLD is generated on-chip (as specified in loc) and is an 


| 00] 

= output; in external mode, OLD is an input. 
SY VO(3) | Synchronization Provides frame synchronization to or from the serial port. 
Cki ci In System clock. 


1 — -1— 
RESTN Reset Highest priority exception. When asserted, the processor enters 
3-state 


into the reset state. A low-to-high transition initiates the reset 
sequence, Call r22+0 (r20); nop. 
When asserted, all DSP3210 output pins are 3-stated. ZN has an 
on -chip pull- “Up, therefore, it is not necessary that it be connected. 
IACKO O(3) ia 
Acknowledge 0 
TRIN |__| __| Interrupt Request 1 
IACK1 O(3) | Interrupt 


Acknowledge 1 


| —voo [Ff Power —_] 
P 


Ground 







































Indicates the servicing of the external interrupt request 1. 





* | =Input, O = Output, P = Power, NC = No connection, OD = Open Drain, and (3) = 3-state. 
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Table 5-3 is a summary of the characteristics of the signals. 


Input/Output | Active State 
| ZN | 


Table 5-3 Signal Summary 


Signal Function Signal 
Name 

































2 
a 


Serial Input Clock ICK 





[DataBus_ SSS] =~ D0—D31__| InputvOutput_| High/Low 
[AddressBus_—s«| == A2—AST [Output High/Low _| 
MSO—MS3 Output High/Low | Yes 
MSO—MS32 Output Low Yes 
[Cycle Stat +| CSN ‘| Output | Low __| Yes | 
[Address Strobe (| ASN [Output | Low _| Yes | 
[ReadWrite S| ~—SsRW Output High/Low _| 
foals | nae | up| tow __| 
Read Strobe MRN4 
Data Enable DEN p= se | Low 
Write Strobe MWNS Low Yes 
[DataLatchEnable | DLE [| ——sinput__— [High |, — 
[Synchronous Ready | SRDYN | Input_ [Low | — 
[BusEror | BERN | Input | tow | — J 
[Bustock | LOCKN | Output |" tow __| Yes | 
[Bus Request| BRN | Output [Low __| Yes | 
(Bus Grant) =o oo ef SBGNees) = sinpubes [tow | =) 
a Se Te 
BIOO—BIO7 p tp High/Low 
[input | High/Low | — | 
Hiparosbs 


<|< 





Serial Data In we p 



















a 
Ea 
— 
a 
Ea 
a) 
Es 
(es 
es 
Ee 
=) 
| Yes | — | 
Et 
a 
faa} 
a 
za 
a 
= 
es 
Bea 
Ea 
=a 


a 

LI 

cam 

meister aa 

ILD p p High-to-Low | Yes = 
[SerialDataOut «| ~~ ~DO | Output_| High/Low _| jee] 
Feel Cur tase —| — xd —|-Potoubr rare wa 
OLD p p High-to-Low Ea 
Ss 
Cea Cas (Sa ea Se (nel 
[Reset S| ~CRESTN | InpuvOutput | Low [| — | ea 
Ei =e eri ea eas id Sel RS aa 
Interrupt Request 0 Input -—tie | — | 
TACKO Output High Yes [ — | 
[interruptRequesti | __IRIN | __—input_— [Low | — | Esa 
aa 
a = 

SS I 


Powers 2t Swe Sul Voo 
V: Input 


The function of these pins in based on pew(9]. This entry is for pew[9] = 0. 
The function of these pins in based on pew(9]. This entry is for pew{9] = 1. 
The function of this pin in based on pew(12]. This entry is for pcw(12] = 1. 
The function of this pin in based on pew\12]. This entry is for pew(12] = 0. 

"The function of this pin in based on pew(12]. This entry is for pew{12] = 1. 
The function of this pin in based on pew[12]. This entry is for pow{12] = 0. 
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6. BUS INTERFACE 


This section provides a functional description of the bus, the signals that control it, and the bus cycles provided for 
data transfer operations and bus arbitration. For exact timing specifications, refer to Section 12 - Electrical 
— Characteristics. 


The DSP3210 allows 8-bit, 16-bit, and 32-bit operands to be located in external memory that is typically 32-bits 
wide. Section 3.4 Memory Organization describes how data of various sizes may be addressed. The DSP3210 
has an address bus that specifies the word address for the transfer and a data bus that transfers the data. 
Control signals indicate the beginning of the cycle, the data size or bytes of a 32-bit word requested, and the type 
of transfer, read or write. Bus cycles are synchronous. The ready line, SRDYN, the bus error signal, BERRN, 
and the bus grant signal, BGN, must meet setup and hold times for reliable operation. 


6.1 Bus Interface Signal Descriptions 


DOo—D31 









Address Bus (Output; 3-state). The address bus conveys word addresses to the external 
system. The address space of the DSP3210 is partitioned between on-chip memory and 
external memory. External addresses initiate bus cycles and are accompanied by control 
strobes. Internal memory addresses are issued externally, but do not initiate bus cycles, unless 
1) an external bys cycle is in progress or 2) the DSP3210 is not granted its bus. Only addresses 
for external memory are accompanied by control strobes. 3-stated by ZN, AEN, and 3-stated 
when BGACKN is negated. 

Data Bus (Bidirectional; 3-state). The external data bus transmits data to and from the 
DSP3210. The data bus can transmit bytes (8-bit data), halfwords (16-bit data), and words 
(32-bit data). On write transactions, byte data is replicated on the three remaining bytes of the 
data bus while halfword data is replicated on the remaining halfword of the data bus. 3-stated 
by ZN and DEN. 
Memory Selects (Output; 3-state). The information transmitted on MSO—MSS is controlled by 
two bits in the PCW register, PCW(9—8]. PCW(9], M/IN, controls whether Motorola or Intel 
style signaling is used to convey the data type and word offset information. PCW/[8], B/LN, 
controls the byte ordering used when addressing internal or external memory. If M/IN = 1, 
Motorola semantics are selected and MSO—MS3 have the same meaning as AO-A1 and SIZO- 
SIZ1, where MSO is AO, MS1 is A1, MS2 is SIZO, and MS3 is SIZ1. If M/IN = 0 and B/LN = 0, 
Intel semantics are selected and MSO—MS3 have the same meaning as BEO#—BE3#, where 
MSO is BEO#, MS1 is BE1#, MS2 is BE2#, and MS3 is BE3#. Figure 6-1 shows the signal 
information for all data accesses supported by the DSP3210. Also shown is the signaling 
supplied if M/IN = 0 and B/LN=1. 3-stated by ZN, AEN and 3-stated when BGACKN is ne 
Cycle Start (Output; Active low; 3-state). When asserted (low), this signal indicates the 
beginning of an external memory transaction. CSN may be used in DRAM memory designs to 
generate an early RAS signal. 3-stated by ZN and 3-stated when BGACKN is negated. 
Address Strobe (Output; Active low; 3-state). When asserted (low), this signal indicates that 
the address (A2—31) and memory selects (MSO—MS3) are valid. When negated, this signal 
indicates that the memory transaction is complete. 3-stated by ZN and 3-stated when BGACKN 
is negated. 

Read/Write (Output; 3-state). The sense of the read/write signal is controlled by the R/WN bit 
in the PCW register, PCW[11]. If R/WN = 1, RW has the same semantics as R/W- ina 
Motorola 680x0. When high, a read transaction is indicated, and when low, a write transaction 
is indicated. If R/WN = 0, RW has the same semantics as W/R# in an Intel i386. When high, a 
write transaction is indicated, and when low, a read transaction is indicated. 3-stated by ZN and 
3-stated when BGACKN is negated. 


























MSO—MS3 


























gated. 
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Bus Interface Signal Descriptions (continued) 


[_Name_| 
LOCKN | Bus Lock (Output; Active Low; 3-state). When asserted (low), this signal indicates that the 












DSP3210 is requesting interlocked bus transactions. The LOCKN is asserted during the execution 
of a dolock instruction sequence. 3-stated by ZN and 3-stated when BGACKN is negated. 

Block Move (Output; Active Low; 3-state) BLMN and PBD share one pin. When 
pew[15—14]=00, BLMN is selected. When asserted [low], BLMN indicates that the DSP3210 is 
starting to perform a quad-word block move operation. The BLMN is asserted each time during the 
doblock instruction sequence in which the two least significant word address lines, A2A—A3, are 0 
and at least four words remain to be transferred. 3-stated by ZN and 3-stated when BGACKN is 
negated. 

Page Break Detect (Output; Active high; 3-state). BLMN and PBD share one pin. When 
pew[15—14] = 01, 10, or 11, PBD is selected. When asserted [high], this signal indicates that the 
current external memory access is not on the same page as the previous external memory access. 
The page size is specified in the pew register, pcw[4—6], and the default setting is 256 words. 
When a page size of 256 words is selected, sequential addressing break detection can also be 
enabled by pcw(7]. Used to achieve higher speed accesses to static column and page mode 
DRAMs or VRAMs. PBD is always asserted for the first transaction after the DSP3210 takes 
ownership of the bus. 3-stated by ZN and 3-stated when BGACKN is negated. 






















BLMN 






PBD 





















Data Type Memory Selects 
Data Lew MIN=1 & B/LN=X 
PES Cocs 
esa Va 

I Ee [4 [1 | 
Pot ai (Oa 
ES 


7 


Type 

eet) 
[e-it_| 
[ei _| 
[est | 





bit 





Gal 
es 
1{o[0 | 
Eu 


+ 4-word transfers are also signaled with the BLMN pin, 


joes ee) jue 
peed a 
OH tl 
ee 
oe a 
Mf Bal! oa 
Li] 0 | ea 
ete | 9 | 
eee Es 


ect 

SE Rk 
ete ale ral 
ES RES jai 
P2 Tif) | ° | 
G5 shat | 2) | 9 | 
ee [eo 





Write Data Key: 


A = logical write data D0O—7 with little endian byte ordering. 
logical write data D8B—15 with little endian byte ordering. 
logical write data D16—23 with little endian byte ordering. 
D = logical write data D24—31 with little endian byte ordering. 





Underline indicates logical position for write data for little endian byte ordering. 
Bold indicates logical position for write data for big endian byte ordering. 


Figure 6-1 External Memory Select Signaling and Write Data Duplication 
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Bus Interface Signal Descriptions (continued) 


Address Enable (Input; Active Low). AEN and MRN share one pin. When pcw(12] = 1, AEN is 
selected. When asserted (low), AEN enables the address bus, A2—31, and memory selects, 
MSO—MS3, and when negated, AEN tristates same (Note that ZN and BGACKN take 
precedence over AEN). AEN, along with DEN, can be used to multiplex the address bus and 
data bus. 

Read Strobe (Output; Active low; 3-state). AEN and MRN share one pin. When pew[12] = 0, 
MRN is selected. MRN is asserted (low) during read transactions to indicate that data may be 
placed on the data bus (D0O—D31). MRN is typically used in SRAM memory designs as an 
output enable. 3-stated by ZN and 3-stated when BGACKN is negated. 

Data Enable (Input; Active Low). DEN and MWN share one pin. When pcw([12] = 1, DEN is 
selected. When asserted (low), this input enables the data bus (D0—D31) during write 
transactions (Note that ZN takes precedence over DEN). DEN, along with AEN, can be used to 
multiplex the address bus and data bus. 

Write Strobe (Output; Active low; 3-state). DEN and MWN share one pin. When pew([12] = 0, 
MWN is selected. MWN is asserted (low) during write transactions and data setup and hold 
times are provided around its rising edge. MWN is typically used in SRAM memory designs as a 
write enable. 3-stated by ZN and 3-stated when BGACKN is negated. 

Synchronous Ready (input; Active Low). Indicates that a synchronous data transfer is 
complete. When asserted, the DSP3210 completes the current transaction. This is a 
synchronous input and setup and hold times with respect to CKI must be met for reliable 
operation. SRDYN is used in conjunction with BERRN and BGN to signal bus errors, retry, and 
relinquish-retry operations. 

Bus Error (Input; Active Low). Indicates that the data transfer is invalid. It is used in conjunction 
with SRDYN and BGN to indicate that a retry, relinquish/retry, or bus error exception should be 
taken. A bus error causes tHe processor to proceed to an exception handler. Refer to Section 
6.4 - Bus Error for the encoding and assertion results for SRDYN, BGN and BERRN. This is a 
synchronous input and setup and hold times with respect to CKI must be met for reliable 
operation. 

Data Latch Enable (Input; Active High). When asserted (high), this input enables data on the 
data bus (DO—D31) into the input latch. DLE is provided to simplify the design fora DSP3210 
communicating on a system bus operating asynchronously from the DSP3210. 
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6.2 Processor Contro! Word 


The Processor Control Word, pew, is a 16-bit internal register that can be read or written using selected CA Data 
Move Instructions, such as pew = r1. The pew register is used to configure the external memory wait-states in 
partitions A and B, the page size for generating the page break detect (PBD), the memory map, the byte ordering, 
and the signaling supplied for external memory interfacing. Figure 6-2 shows the bit encoding of the pew register. 
By writing a one in Bit 13 with a program, the value loaded into the pew register can be locked so that programs 
do not inadvertently change its value. The effect of writing this bit is cleared when the processor is reset or when 
an error exception is taken. pew[10—13] are loaded with the logic values applied to pins BIO[7:4], respectively. 
The remaining pew register bits are set or cleared by reset as shown. 


19 pote — Ae) S10 | 8 nr eas, 2\4 
per WE | 


Bit [45 


14 
Field P/BN 
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WA | These bits configure the number of wait-states that are generated for transactions in 
partition A. Set to {1,1} on reset. 
pew 1.0 00 0-or-more wait-states 
01 ~—-1-or-more wait-states 
10 2-or-more wait-states 
11__3-or-more wait-states 
These bits configure the number of wait-states that are generated for transactions in 
partition B. Set to {1,1} on reset. 
pew 3.2 00 ~—_0-or-more wait-states 
01 =‘ 1-or-more wait-states 
10 = 2-or-more wait-states 
11__3-or-more wait-states 
These bits configure the page size for asserting the page break detect (PBD) signal. 
A2—AQ are always masked, A17—A31 are always unmasked, A10—A16 masking is 
controlled by the PBD field when comparing the last address to external memory. Set 
to {0,0,0} on reset. 
Page Size DRAM size 
256 words 64k xn 
512 words 256k x n 
1K words 1Mxn 
2K words 4Mxn 
4K words 16M xn 
8K words 64M xn 
16K words 256M xn 
32K words iGxn 
This bit enables(1)/disables(0) sequential address break detection and can be used 
with a page size of 256 only. Set to one on reset. 
Configures the byte ordering. If 0, little endian byte ordering is selected. If 1, big 
endian byte ordering is selected. Set to one on reset. 
Configures the MSO—MS3 pins. If 0, Intel style signaling is selected. If 1, Motorola 
style signaling is selected. Set to one on reset. 
Selects between one of two memory maps. If 1, w.Computer mode is selected and on- 
chip memory is at location zero. If 0, u.Processor mode is selected and external 
memory is at location zero. Reset value based on the state of bio[7] during reset. 
Configures the RW pin. If 0, RW pin is active high for write transactions. If 1, RW pin is 
active high for read transactions. Reset value based on the state of bio[6] during reset. 
Configures the operation of pins AEN/MRN and DEN/MWN. If 0, SRAM interface 
signaling selected (MRN and MWN). If 1, DRAM system bus interface signaling 
selected (AEN and DEN), Reset value based on the state of bio[5] during reset. 
Reset value based on the state of bio[4] during reset. Controls Boot ROM routine. 
Also, when written with a one, modification of the pew is disabled. 
15—14] P/BN | Configures the PBD/BLMN signal. Set to {0,0} on reset. 
pew 15.14 00 BLMNis selected 
01  PBDis selected; addresses to partition A are monitored. 
10 PBDis selected; addresses to partition B are monitored. 
11__ PBD is selected; addresses to partitions A and B are monitored. 





Figure 6-2 PCW Register Encoding 
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6.3 Standard Bus Cycles 


The DSP3210 inherently supports synchronous bus cycles. Synchronous bus cycles are completed by 
handshaking using the SRDYN signal. The point in the transaction at which SRDYN is observed can be altered 
by configuring the pew register for zero-, one-, two-, or three-or-more wait-states. The number of wait-states 
inserted is the longer of the number programmed or the number added using handshaking. The operation of the 
bus is summarized in Figure 6-3. Sections 6.3.1 to 6.3.5 walk through the operation of the standard bus cycles 
available in the DSP3210. For the discussion, it assumes that bus error (BERRN) is not asserted and that pcw[9] 
is configured for SRAM signaling (MRN and MWN). 













STATES 
Zero programmed wait-states | 1 [dle State (Initial State 


| 2 | Two programmed wait-states S1_| First Programmed Wait-state 
3__| Three programmed wait-states Second Programmed Wait-state 
[iP teres emo tequest asseried __ ry request asserted S3__| Third Programmed Wait-state 
| B__| Bus Grant (BGN) asserted E__| End Cycle State 
[Synchronous Ready (SRDYN) asserted |B [Bus GrantState 
Error Exception Acknowledge Ret 
Logical AND,OR, and NOT, respectivel 


Figure 6-3 Bus Interface State Diagram 








































DSP3210 Information Manual 
Bus Interface 


6.3.1 Read Transaction (0-or-more waits) 


A read transaction configured with 0-or-more wait-states takes a minimum of two clock periods to complete. Each 
clock period consists of two phases, a0 and 01. Wait-states repeat State E (see Figure 6-4). 


State SO (90): 
State SO (91); 
State E (90): 
State E (91): 


The address (A2—A31), the memory selects (MSO—MS3), the read/write signal (RW), and 
the page break detect (PBD/BLMN) are issued. The cycle start strobe (CSN) is asserted. 
The address strobe (ASN), and the read strobe (MRN) are asserted. 

CSN is negated. 

During phase 1, the synchronous ready line (SRDYN) is observed to indicate the end of the 
transfer. If SRDYN is high, a wait-state is inserted, and if it is low, the transfer is completed. 
At the end of the transfer, the DSP3210 samples the data bus (D0—D31), and ASN and MRN 
are negated. 


6.3.2 Synchronous Write Transaction (0-or-more waits) 


Awrite transaction configured with 0-or-more wait-states takes a minimum of two clock periods to complete. Each 
clock period consists of two phases, 00 and 01. Wait-states repeat State E (see Figure 6-4). 


State SO (90): 
State SO_(91); 
State E_(¢0): 
State E (91); 


The address (A2—A31), the memory selects (MSO—MS3), the read/write signal (RW), and 
the page break detect (PBD/BLMN) are issued. The cycle start strobe (CSN) is asserted. 
The address strobe (ASN), and the write strobe (MWN) are asserted. 

The CSN is negated, and data is enabled onto the external data bus (DO0—D31). 

MWN is negated. During 01, the synchronous ready line (SRDYN) is observed to indicate the 
end of the transfer. If SRDYN is high, a wait-state is inserted, and if it is low, the transfer is 
completed. At the end of the transfer, the DSP3210 3-states the data bus (D0—D31), and 
ASN is negated. Note that MWN is negated at the beginning of State E (01) even if wait- 
states are inserted. ? 


6.3.3 Read Transaction (1 to 3-or-more waits) 


A read transaction configured with 1 to 3-or-more wait-states takes a minimum of three clock periods. States S2 
and $3 are conditionally needed depending on the number of waits selected. External wait-states repeat State E. 
(see Figures 6-5 to 6-7). 


The address (A2—31), the memory selects (MSO—MS3), the read/write signal (RW), and the 
page break detect (PBD/BLMN) are issued. The cycle start strobe (CSN) is asserted. 
The address strobe (ASN), and the read strobe (MRN) are asserted. 


Entered only if two or three waits are selected . 


Entered only if three waits are selected. 


CSN is negated. 

During 01, the synchronous ready line (SRDYN) is observed to indicate the end of the 
transfer. If SRDYN is high, a wait-state is inserted, and if it is low, the transfer is completed. 
At the end of the transfer, the DSP3210 samples the data bus (D0—D31), and ASN and MRN 
are negated. 
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6.3.4 Write Transaction (1 to 3-or-more waits) 


Awrite transaction configured with 1 to 3-or-more wait-states takes a minimum of three clock periods to complete. 
States $2 and $3 are conditionally needed depending on the number of waits selected. External waits repeat 
State E (see Figures 6-5 to 6-7). 


State SO (90); + The address (A2—31), the memory selects (MSO—MS3), the read/write signal (RW), and the 
page break detect (PBD) are issued. The cycle start strobe (CSN) is asserted. 
State SO (91): | The address strobe (ASN), and the write strobe (MWN) are asserted. 


State $2 (90): — Entered only if two or three waits are selected. If two or three waits are configured, data is 
enabled onto the data bus (D0—D31). 


State S2 (91); 

State $3 (00); Entered only if three waits are selected. 

State $3 (91); 

State E (90): CSN is negated. if one wait is configured, data is enabled onto the external data bus 


(DO—D31). 

State E (91); MWN is negated. During phase 1, the synchronous ready line (SRDYN) is observed to 
indicate the end of the transfer. If SRDYN is high, a wait-state is inserted, and if it is low, the 
transfer is completed. At the end of the transfer, the DSP3210 3-states the data bus 
(DO—D31), and ASN is negated. Note that MWN is negated at the beginning of State E (01) 
even if additional wait-states are inserted. 


6.3.5 Idle State 


An idle bus state is one clock period long. All control strobes, CSN, ASN, MRN, and MWN, are in their negated 
state. The RW signal signifies a read. The PBD retains its state from the previous bus cycle. The address bus 
A2—31 will reflect the address of internal memory accesses if they were initiated in the previous processor state. 
The MSO—MS3 signals will be high. 


6.4 Bus Error 


The DSP3210 bus protocol expects the assertion of SRDYN to terminate a bus cycle. If the bus cycle encounters 
an error condition, external circuitry can: 


—request the DSP to retry the transaction 
—defer the retry by causing the processor to relinquish the bus 
—terminate the transaction and cause the processor to enter an exception handler 


These actions are signaled via the BGN, BERRN, and SRDYN pins. Table 6-1 shows the action taken by the 
DSP for all signal conditions. 


Table 6-1 BGN-BERRN-SRDYN Codes and Results 


BERRN SD i ia ae pe a a] 
[| ____0 __| Retry bus transaction immediatel 
| __0 __| Defer retry, relinquish bus immediatel 
=== oma . 

















Terminate transaction and take bus error exception 
Normal termination 
Insert wait-state. 





[__BGN | 
ee Rae) 
is Sed 
=n Say 
Sa pea ee 
ae ess 
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READ 0 WAITS READ 1 WAITS WRITE 0 WAITS WRITE 1 WAIT IDLE 





Figure 6-4 Bus Protocol (0-or-more wait-states) 


| READ 1 WAIT | WRITE 2 WAITS | IDLE | 
so St E so Ss! E E I 





a aa a eee ee ee am 
(RWN=1) | | 


ee oper SL ee ae) eres eee 8 oe a 


= MRN 








Figure 6-5 Bus Protocol (1-or-more wait-states) 
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| READ 2 WAITS, | WRITE 3 WAITS ] We | 
1) st s2 E so st $2 E E ' 


KR 





Figure 6-6 Bus Protocol (2-or-more wait-states) 


I READ 3 WAITS l WRITE 4 WAITS ; me | 
so si s2 83 e so st s2 83 E E ' 





Figure 6-7 Bus Protocol (3-or-more wait-states) 
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A bus error protocol is signaled to the DSP3210 by the bus error pin, BERRN, in conjunction with SRDYN. The 
two actions that the DSP can be instructed to take are retry or error exception (stop normal processing and vector 
to the bus error exception handler, see 7.3 Exception Processing). A retry bus cycle is initiated in the following 


clock cycle using the same transaction information. Multiple retries can occur if transactions that are retried 
encounter the retry code when completing. Negating BGN while asserting BERRN and SRDYN provides a 


relinquish and retry operation where the retry cycle will commence when the DSP regains the bus. When the bus 


error exception code is detected (BERRN asserted and SRDYN negated), the current bus cycle is terminated, 


outstanding fetches are invalidated, and the processor proceeds to the exception handler. 


Figure 6-8 is a timing diagram of a retry operation. 


A2—A31, 
MSO—MS3 
CSN 


ASN 


MWN 


Do—D31 


SRDYN 


BERRN 


\ READ \ RETRY - READ 
so E So E 


| | | 
XXXXXXXKXXXXXXXA 1 AXXXXXXKXXX [XX 

| | | 
XXXXKRXXXXXKXKK A, AXXXXXKXXXX7 | \XXK 


Figure 6-8 Read Cycle Retry 
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6.5 Bus Arbitration 


The DSP3210 shares the external memory interface bus through a request/acknowledge protocol. Bus arbitration 
is the protocol used by the DSP3210 to become the bus master. An external arbiter controls the arbitration 
process, and the DSP is a slave device and gains access to the bus by requesting ownership of the bus. The bus 
arbiter is responsible for assigning priorities to the devices sharing the bus and granting devices the bus. 


The three signals associated with the bus arbitration are bus request (BRN), bus grant (BGN), and bus grant 
acknowledge (BGACKN). 


BRN Bus Request (Output; Active low; 3-state). When asserted, this signal indicates that the DSP 
needs to perform an external memory transaction. BRN is negated when there are no further 
transactions pending. BRN is asserted even if the DSP has ownership of its bus interface. BRN 
is asserted during a relinquish-retry operation until the retry cycle is commences. BRN is 
asserted throughout an interlocked sequence of bus cycles (see Section 6.6.2 Interlocked Bus 
Cycle Indicator). BRN can be asserted in advance of DMA cycles of the DMA controller (see 
Section 9.2 DMA Controller). 3-stated by ZN. 


BGN Bus Grant (Input; Active low). When asserted, this signal indicates to the DSP that it has been 
granted ownership of the bus When negated, the DSP will relinquish the bus at the completion 
of the current bus transaction. BGN is a synchronous input and setup and hold times with 
respect to CKI must be met for reliable operation. For single bus master systems BGN should 
be tied low so that there is no arbitration overlead. BGN along with BERRN are used to support 
a relinquish and deferred retry bus cycle. 


BGACKN Bus Grant Acknowledge (Output; Active low; 3-state). When asserted, this signal indicates 
that the DSP has acknowledged the bus grant and is driving the external memory interface. 
When negated, it indicates that the DSP external memory interface is in the high-impedance 
state. 3-stated by ZN. 


6.5.1 Bus Arbitration Operation 


When the DSP does not have ownership of the bus, it can execute from its internal memory until accesses to the 
external memory are needed. At that point, the DSP requests ownership of the bus by asserting the BRN. The 
arbiter responds to the bus request by asserting the bus grant pin of the DSP, BGN. The DSP acknowledges the 
grant by asserting the bus grant acknowledge, BGACKN and driving the external memory interface pins. Note 
that when the DSP3210 takes ownership of the bus the first transaction is always signaled as a page break. 
When the DSP has no more pending bus cycles the DSP negates BRN. When the external arbiter negates BGN, 
any on-going external memory transaction is completed before the DSP relinquishes the bus by placing all output 
and biputs associated with the external memory interface bus in the high-impedance state and negating 
BGACKN. The signals affected by bus arbitration are indicated in Table 5-3 Signal Summary. 


Figure 6-9 is a timing diagram illustrating an example of bus arbitration activity. 
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+ The signal is dependent on whether it is a read or awrite cycle. 


Figure 6-9 Bus Arbitration Example 


6.5.2 DSP3210 BRN Functional Timing 


This section contains examples of BRN functional timing, that is the state of BRN for various sequences of 
external memory access sequences. 


1) Instruction Fetch or doblock (one ext. access per instr. cycle): 


State 0 1 2 3 0 1 2 3 0 4 
Address Ext = - a Ext 2 - = Ext S 
Bus Intf. | so E | I so E | i so 





BRN 7 eel 2 asa 


2) Instruction Fetch or doblock (one ext. access per instr. cycle; one prog. wait ): 


State 0 1 2 w 3 0 1 2 Ww 3 
Address Ext = 5 = = Ext - - = = 
Bus Intf. | so $1 E | | so $1 = | 


BRN 
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3) Instruction and Y Fetch (two ext. non-back-to-back accesses per instr. cycle): 


State 0 1 cs 3 0 1 2 3 0 1 
Address Ext * Ext * Ext : Ext = Ext z 
Bus Intf. | so E So E So E so E so 


BRN l 


4) Instruction and X Fetch (two ext. back-to-back accesses per instr. cycle): 


State 0 1 Ww 2 3 () 1 WwW 2 3 
Address Ext Ext : < = Ext Ext * = = 
Bus Intf. | So E so E | so E so E 


BRN l 


5) Four ext. accesses per instr. cycle: 


State 0 1 Ww 2 Ww 3 Ww 0 WwW 1 
Address Ext Ext B Ext 5 Ext = Ext : = 
Bus Intf. | so is so E So E so E so 


BRN 


6) Request/Grant: 


State 0 Ww Ww 1 2 3 Ww 0 Ww 1 
Address Ext = : . E Ext = Ext : 2 
Bus Intf. | B B so E | B So E so 


BRN J 


BGN l Savona Le 


7) Relinquish/Retry: 


State ) Ww w 1 2 w Ww Ww WwW 3 
Address Ext B = : - - - - = 5 
Bus Intf. | B B so E:; BR BR SR E 1 


BRN l 
BGN 
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6.6 Special Interface Capabilities 


The DSP3210 has special features that increase system design flexibility. These features improve memory 
access performance and minimize the number of components necessary when the DSP3210 is being used on an 
asynchronous and/or multiplexed system bus. The features are implemented with the pins PBD/BLMN, LOCKN, 
AEN, DEN, and DLE. AEN, DEN, and DLE operate asynchronously with respect to the rest of the external 
memory interface. Descriptions of these signals is provided in Section 6-1 Bus Interface Signal Descriptions. 


6.6.1 Page Break Detect Indicator 


The page break detect signal can be used to increase system performance in systems using DRAMs or VRAMs. 
Bits 4—7 and 14—15 of the pew register are used to configure the page break detect signal. Bits 4—6 define the 
page size of the memory device. See Figure 6-2 PCW Register for page size settings. Page sizes can be 
selected that correspond to DRAMs from 64K to 1G and VRAMs with a 256 bit shift register. 


Bits 14—15 define the address space for which page break detection is desired. Partition A, Partition B, or the 
entire external memory space can be selected as the region the page break detect is monitoring. Addresses 
outside the selected region are ignored and have no effect on the page break detect signal. This is useful when 
program and data are separated in different physical external memory devices and increased performance can be 
achieved by monitoring accesses to only one of the partitions. The page break pin is asserted when an access to 
the monitored external memory space is not to the same page as the previous access to the monitored external 
memory space. 


Bit 7 is used when the page size is 256 words to enable detection of sequential addressing breaks. This is useful 
when using VRAMs where data must be read/written sequentially in order to maximize performance. If the next 
address is not the next word address, A(N+1) # A(N) + 4, then a sequential addressing break is detected and the 
page break pin is asserted. 


* 
On reset, the page break detect pin is configured to convey BLMN information. This signal will be negated (logic 
high) which also can be viewed as signaling a page break until the pew register is set with the appropriate value. 
The page break is always asserted when the DSP3210 regains bus mastership even if the access is to the same 
page as the previous external access. Similarly, the page break detect is asserted on the retry of a relinquish- 
retry operation. 


6.6.2 Interlocked Bus Cycle Indicator 


The DSP3210 provides a signal, LOCKN, to indicate that it expects to maintain ownership of the bus for one or 
more additional bus transactions. The LOCKN signal is asserted during the execution of a dolock instruction 
sequence. The bus request pin, BRN, is also asserted for the duration of the dolock instruction sequence. This 
can be used to perform secure semaphore updates in shared memory systems by emulating a read-modify-write 
operation within the dolock sequence. These instruction sequences are typically not repeated but the structure 
does allow for it. Interrupts and DMA are disabled during the dolock sequence. 


Up to 128 in-line instructions can be encompassed by the dolock indicators, therefore complex and arbitrary bus 
interlocked sequences can be performed at an overhead of one instruction cycle to execute the dolock instruction. 
The LOCKN indicator is asserted during the Operand Fetch cycle of the instruction pipeline (see Figure 7-1A 
Pipeline for a Single DA Instruction). It is not intended that DA instructions with Z-field writes be used in 
interlocked sequences because the LOCKN indicator will be negated before the write is performed, but four nops 
following the last DAU write will accomodate their use. Note that CA instruction read and write operations are 
always generated during the Operand Fetch cycle. 


In the following example, instructions I1 to 13 are used to perform a test and set operation. Instructions 14 and I5 
are subsequent instructions that cause the processor to branch. 


ih dolock 1,0 /* Assert LOCKN and disable interrupts and DMAs for the 
following two instructions Operand Fetch cycles */ 

12 rm="r2 /* Loading r1 will set the n and z flags for the operand pointed 
to by r2*/ 

3 *r2=13 /* 3 contains the flag value to be written back */ 

4 if(eq) pcgoto czero—/* Flags corresponding to the value in rt can be tested here */ 
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15 nop 


If both instruction and operand accesses were to external memory, the activity and the extent of the assertion of 
the LOCKN pin for the above sequence of instructions would be: 


" 1 Fetch dolock 1,0 

l2 1 Fetch ri="r2 

13 1 Fetch *r2=r3 

Ol 0 Read operand for |2, *r2 
14 0 Fetch 14 

oO (0) Write operand for 13, *r2 
15 1 Fetch I5 


6.6.3 Quad-word Block Move Indicators 


A block move is a programmed transfer of a block of data from one location in memory to another location in 
memory. It is used to move blocks of data from external memory to the DSP3210 internal memory or from the 
DSP3210 internal memory to external memory. The DSP3210 provides two methods of signaling a quad-word 
block move operation. When Motorola signaling is selected (pew[9] = 1), a quad-word transfer is signaled the 
same as a 68040 burst-inhibited line read. The first word of a group of four words to be transferred sets the 
MS3—MS2 pins to 11. The remaining three words to be transferred set the MS3—MS2 pins to 00. Secondly, 
the BLMN pin will be asserted for the first word of the group of four words when selected. The PBD/BLMN pin is 
selected to be BLMN by setting pcw[15—14] = 00. It is the responsibility of the external bus arbiter to grant the 
bus to the DSP3210 for the duration of the quad-word transfer. 


The instruction sequence used to perform block moves is a doblock instruction followed by the DA instruction, 
a0=(Z=Y)*a0, in which the Y operand is written back to memory performing one memory-to-memory transfer 

per instruction cycle. The doblock loop is decomposed into atomic four word transfers that begin with an address 
with A3—A2 = 00 and sequence 01, 10, 11. When the do-loop count is four or more and the address has 

A3—A2 = 00, the start of an atomic quad-word transfer is detected. Interrupts and DMAs are disabled for the next 
three instruction cycles so that the quad-word transfer is not disrupted. Interrupts and DMA cycles are arbitrated 
between quad-word transfers. The address of the first read or write location does not have to be quadword 
aligned, nor does the number of iterations need to be a multiple of 4. Read or writes prior to the detection of a 
quad-word transfer are standard long word transactions. Reads or writes at the end of the loop when fewer than 
four words remain are also standard long word transactions. 


Due to the DAU pipeline, when performing block moves to external memory, at least the last three words and up 
to as many as the last six words will not be able to be made into quad-word transfers. This is because the 
remaining block count is compared to the address during the Operand Fetch cycle of the instruction pipeline (See 
Figure 7-1A Pipeline for a Single DA Instruction). Also, while the last three words are being written subsequent 
instructions can be fetched invalidating the sequential addressing nature of a quad-word transfer. A block move 
to external memory should always be followed by three instructions that do not have Z-writes before a subsequent 
block move is initiated. 


The following doblock loop performs a block move of r3 + 1 words: 


doblock r3 /* Repeat the following instruction r3 + 1 times */ 

aO= (*r2++=*r1++)*a0 7* Note that this instruction is cached on-chip; the r1 pointer is to the 
source, and the r2 pointer is to the destination */ 

3*nop /* Three instructions must be executed between block moves to 


external memory in order to clear the pipe */ 


Figure 6-10 shows the timing for the bus interface signals for a quad-word transfer from external memory to 
internal memory. It consists of four consecutive read cycles. Even though the DSP3210 has a bus bandwidth of 
two words per instructions cycle, when using this type of instruction, it can accept data at one word per instruction 
cycle . Therefore, two idle states are observed between each bus transaction. 
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Figure 6-10 Quad-word Block Move Timing 


6.6.4 Multiplexed Address and Data Bus 


To use the DSP3210 in a multiplexed address and data bus system, AEN and DEN must be used. Basically, an 
external state machine should control when the address is enabled/disabled for read and write operations, and 
control when the data is disabled/enabled for write operations. Figure 6-11 shows an example of the timing for a 
multiplexed address and data bus. AEN and DEN operate asynchronously to the DSP3210 bus protocol. See 
Section 12 Electrical Characteristics for timing specification for AEN and DEN. 

° 
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Figure 6-11 Multiplexing the Address Bus and Data Bus 
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6.6.5 Use of the Data Latch Enable 


The data latch enable pin, DLE, allows read data to be latched by the DLE signal, instead of the DSP3210 clock. 
This simplifies the interface of the DSP3210 to a system bus that is synchronous with respect to the bus clock, but 
asynchronous with respect to the DSP3210 clock. 


DLE is the gate input of a transparent latch that is between the external data bus and the read data latch. When 
DLE is high, the latch is transparent, and when DLE makes a one-to-zero transition, the data bus is latched. The 
read data latch is clocked, as usual, by the falling edge of CKI, and the completion of transactions are signaled by 
SRDYN and BERRN. DLE only affects the latching of read data. When using DLE, data must meet setup and 
hold times with respect to DLE, not CKI. Figure 6-12 shows the timing for a read data transaction where the data 
is latched with DLE. 
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Figure 6-12 DLE Timing 
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7. PROCESSOR STATES 


This chapter describes the modes of operation of the DSP3210 . The DSP3210 is equipped with advanced 
control features that improve performance and simplify system design. This chapter describes the normal 
instruction cycle, wait-states, direct memory access (DMA), exceptions (including reset, errors, and interrupts), 
and wait-for-interrupt (powerdown). 


7.1 Normal Instruction Cycle 


The DSP3210 has an instruction cycle rate of 1/4 the incoming clock frequency (CK). Each instruction cycle is 
divided into four machine states, numbered 0-3, where one state equals one period of CKI. In a single instruction 
cycle, the DSP3210 can perform an instruction fetch, two operand fetches, and a write to memory. The DSP3210 
can execute basically two types of instructions; CA instructions and DA instructions (see Section 4 Instruction 
Set). Each CA instruction can have up to two memory accesses: (1) memory read (I instruction), and (2) memory 
read or write (X-read, Y-write). Each DA multiply-accumulate instruction can have up to four memory accesses: 
(1) memory read (I instruction), (2) memory read (X operand), (3) memory read (¥ operand), and (4) memory write 
(Z operand). Because the DAU is pipelined, the four accesses do not all occur in the same instruction cycle for a 
given instruction. Figure 7-1A shows when address and data are on the internal address bus and Internal data 
bus for one DA instruction. Figure 7-1B illustrates a full pipe of sequential DA multiply-accumulate instructions. 
The subscripts refer to instruction numbers; instructions are fetched and executed in ascending order. For 
example, Xo is the X operand for instruction number 0, and X-1 is the X operand for the instruction previous to 
instruction number 0. 
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Figure 7-1B Full Pipeline for Multiple DA Instructions 
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7.2 Wait-states 


The number of states that it takes for an instruction to complete is affected by the introduction of wait-states. 
Wait-states are introduced when resources required by the instruction are busy and the processor must wait in 
order to continue processing. There are two types of wait-states: conflict wait-states and external wait-states. 
Conflict wait-states arise when back-to-back memory requests are made to the same physical memory and the 
physical memory is either the Boot ROM/MMIO or the external memory. Conflict wait-states do not occur when 
accessing on-chip RAM, RAMO or RAM1. External wait-states arise when the number of states required to 
access data in external memory is greater than two or when the DSP3210 is requesting an external memory 
transaction and it does not have ownership of its bus interface. 


The DSP3210 has three physical memories; RAMO/RAM1, External Memory A/B and Boot ROM/MMIO. Some 
physical memories can introduce conflict wait-states because it takes two states to read or write them and 
addresses are issued every state. The two physical memories that have this property are the External Memory 
AJB and the Boot ROM/MMIO. The entire external memory space (Partition A and Partition B) is considered one 
physical memory, and the combined Boot ROM and Memory-mapped I/O space is considered one physical 
memory. Wait-states are invisible to the programmer except for their effect of decreasing performance in the 
DSP3210. However, throughput is degraded only incrementally by the insertion of a conflict wait-state because it 
is only 1/4 of an instruction cycle. 


If external memory is slower than that allowed for zero wait-state operation (2 states), external wait-states are 
inserted. Also, if the external memory interface is not granted to the processor when an external memory access 
is required, wait-states are inserted until the bus is granted. Note again that throughput is degraded only 
incrementally by the insertion of external wait-states because each is only 1/4 of an instruction cycle. 


Given the pipelined nature of this device, it is not immediately obvious what successive memory accesses are. 
Figure 7-1B shows that several DA instructions may be in various stages of execution during a single instruction 
cycle. To achieve full throughput, no two consecutive accesses can be made to the same physical memory that 
incurs wait-states. Thus, for example, if the X and Y operands for a given instruction are fetched from the external 
memory, one conflict wait-state will be inserted between cycle 0-state 3 and cycle 1-state 0. Similarly, if the result 
for instruction I is written to external memory and instruction i + 4 is fetched from external memory, one conflict 
wait-state is inserted between cycle 1-state 1, and cycle 1-state 2. 


Figure 7-2 shows the effect of conflict and external wait-states on the instruction cycle. In this example, the 
pipeline is full of DA instructions and all memory accesses are being directed to external memory. Instructions 
are in logical partition A with zero wait-states, and data is in logical partition B with one wait-state. Conflict wait- 
states are denoted as Wc, and external wait-states are denoted as We. Even though there are wait-states, the 
entire bandwidth of the external memory port is utilized. 


Address Bus 


External 
Data Bus 





Figure 7-2 Instruction Cycle with Conflict and External Wait-states 


7.3 Direct Memory Access 


External devices can access the on-chip memory in the DSP3210, as well as external memory, using direct- 
memory access (DMA). DMA transfers occur between the memory and the serial I/O port without processor 
intervention, i.e., using cycle-stealing where the DSP3210 waits for one instruction cycle while the DMA transfer 
occurs. Two on-chip DMA controllers support memory access via the serial input port and the serial output port. 
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When DMA is enabled, serial I/O (SIO) transfers occur between the input buffer (ibuf) register and memory and/or 
between memory and the output buffer (obuf) register without program intervention. The DMA control (dmac) 
register enables and configures serial /O DMA activity (see 9.2 DMA Controller). The memory address pointers 
and the number of transfers to perform for input and output DMA are contained in the input pointer (idp), output 
pointer (odp), input transfer count (icnt), and output transfer count (ocnt) registers. The serial DMA facility is 
useful for multichannel applications so that input and output buffers for an entire frame of samples can be 
established in memory. 


Figure 7-3 shows how instruction processing is suspended by the insertion of a DMA cycle. The row labelled IR 
represents the Instruction Register. 
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Figure 7-3 DMA Cycle Timing 


7.4 Wait-for-Interrupt Mode 


The DSP3210 provides a wait-for-interrupt mode so that the power dissipation of the device is reduced when it is 
not performing data processing. The wait-for-interrupt mode is enabled by the execution of the instruction waitl. 
This instruction will cause the processor to enter continuous DMA-like cycles. If requested, DMA transfers are 
performed. When any unmasked interrupt exception is detected, the processor exits the wait-for-interrupt mode 
and executes the instruction following the waiti instruction before branching to the appropriate interrupt routine 
(interrupt processing may be initiated only after executing an instruction, not after a DMA cycle). Also, the wait- 
for-interrupt mode is terminated if an error exception is detected. 3 


7.5 Instruction Sequencing Exceptions 


The DSP3210 has four distinct instruction processing levels; Base level, Interrupt level, Error level, and Double 
Error level. Movement between any level is considered as an exception process. Base level instruction 
processing can be altered by the introduction of interrupt routines. Base level or Interrupt level instruction 
processing can be altered by the introduction of error handling routines. Error handling can be altered by the 
introduction of a double error. Figure 7-4 shows the processing levels in the DSP3210. 


The DSP3210 handles interrupts using a continuation exception processing model. Single error conditions are 
handled using an abort exception processing model. Double error conditions are handled by stopping the 
DSP3210. 


The continuation exception model means that the necessary processor context is saved so that when exception 
processing is completed, the processor can resume at the point where it left off. The abort exception model 
means that certain processor context is cleared and the current instruction cycle is aborted. The processor can 
not resume processing where it left off. Register information to locate the cause of the error is maintained so that 
debugging can be done (refer to Table 7-3). When a double error condition is recognized, the DSP3210 aborts 
the current instruction, stops, and signals the external system. At this point, the DSP3210 must be reset to start 
instruction processing. 
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Interrupt exception, DMA, and instruction processing decisions are accepted at each instruction cycle boundary. 
In fact, arbitration is done during state 1 in order to decide what to do in the following instruction cycle that starts 
in state 0. Also, interrupt exception, DMA, or instruction processing decisions are based on the highest priority 
activity concurrently being requested. Interrupt exceptions are the highest priority, and instruction processing is 
the lowest. If no interrupt exceptions are requested, then the processor will check to see if a DMA cycle is 
requested. If no DMA cycle is requested, then a normal instruction cycle is performed. When an interrupt 
condition is recognized, the DSP3210 saves the processor context in interrupt shadow registers, and the 
processor branches to the appropriate interrupt handling routine based on the source of the interrupt. When the 
interrupt routine executes an ireturn instruction, normal instruction processing is reinstated by using the context 
saved in the interrupt shadow registers. 


Error exceptions are arbitrated on a state-by-state basis. The detection of an error condition will cause the 
processor to abort the instruction at its current point of execution; processor level and do-loop control information 
are reset, and DMA and interrupts are disabled. Table 7-1 shows the priority of exception processing. Exceptions 
have a fixed priority, but masking can be used to selectively enable desired exceptions. Error and interrupt 
masking is done in the exception mask register (emr). Instructions that lock out interrupts will not respond to an 
interrupt for the duration of the lock. Error exceptions can not be locked out, although some exceptions can be 
masked. When an error condition is recognized, the current instruction is aborted and the processor branches to 
the appropriate error debug handler based on the source of the error. 


The DSP3210 supports a 16-entry exception vector table. Each entry is a pair of 32-bit words starting at the 
location specified in the exception vector table pointer (evtp) register, 122. On reset, the evtp register is cleared to 
zero. To branch to the exception handling routine, the processor multiplies the vector number by eight to 
determine the offset. It adds the offset to the address in the exception vector table pointer (evtp) to obtain the 
memory address of the exception vector. A branch instruction using this address is executed followed by a nop. 
Figure 7-5 shows the layout of the exception vector table. 


Only the initial reset vector is fixed in the DSP3210's memory map (evtp = address 0 on reset). When modifying 
the location of the vector table, the new table entries must be established starting at the new base address. Then, 
the evtp pointer is changed to point to the new base address, and then interrupts are enabled in the emr register. 
The layout of the emr is shown in Table 7-2. 
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Table 7-1 Priority of Exception Processing 
Maskable | ___—Description 
[Errors CSCSC—~‘TCSCCSCSCSCC AXbortt exception model 
[Reset SS] ~SNo_ | Highest priority error condition 
[ Busierior sa 2 oes | ve sNoW | 
Miegal opcode [No] 
[Reserved | NO 
‘Address Error 
DAU Overflow/Underflow 
Lowest priority error condition 
[interrupts S| ~—SSS_—=| Continuation exception model 
External Interrupt 0 Highest priority interrupt condition 
SIO Input Buffer 
SIO Output Buffer 
SIO DMA Input Frame’ 
SIO DMA Output Frame 
External Interrupt 1 Lowest priority interrupt condition 
VECTOR ADDRESS 32-BITS EXCEPTION SOURCE 
NUMBER ! ! 
0 evip +0 RESET 
a +8 BUS ERROR 
2 +16 ILLEGAL OPCODE 
ERROR 
24 
3 + RESERVED EXCEPTION 
4 +32 ADDRESS ERROR 
5 +40 DAU OVERFLOW/UNDERFLOW 
6 +48 NAN 
slit sat pec SOO =e  SRESE RVED) peg ee eres Shee 
8 +64 EXTERNAL INTERRUPT 0 
9 +72 TIMER 
10 +80 RESERVED 
"1 +88 SIO INPUT BUFFER INTERRUPT 
12 +96 SIO OUTPUT BUFFER EXCEPTION 
13 +104 SIO DMA INPUT FRAME 
14 +112 SIO DMA OUTPUT FRAME 
15 +120 EXTERNAL INTERRUPT 1 





Figure 7-5 Exception Vector Table 
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Table 7-2 emr Register Encoding 


Bit 15 | 14 12 jo| 9 S.-|.17: 
Field OFRM | OBE | [RES | TIMER | EXxTO [RES | NAN] 


Each bit in this register corresponds to one of the maskable sources for an exception. A value of 1 in a position 
enables the corresponding interrupt source; a value of 0 disables the source, On reset all bits are set to 0 (zero). 





If 1, select processor level to be visible 
on IACK[1—0]. Encoding: 
00 - Base level 
01 - Interrupt level 
10 - Error level 
11 - Double error level 
Reserved 
Addressing Error Highest 
DAU Underflow/Overflow | 
IEEE NAN | 
Reserved 
Interrupt pin 0 (IRON) request 
TIMER counted to zero 
Reserved 
SIO input buffer full 
SIO output buffer empty 
SIO input DMA frame complete 
SIO output DMA frame complete 
Interrupt pin 1 (IR1N) request 





7.5.1 Interrupt Exceptions 


The DSP3210 provides a single-level interrupt facility and can respond to eight sources, five internal, two external, 
and one reserved. The interrupts are prioritized and are individually maskable via the exception mask register, 
emr. The emr is a 16-bit register with one maskbit for each maskable exception source (see Figure 7-5). 


Interrupt exceptions are handled using a continuation exception model. This means that when an interrupt is 
recognized, the DSP3210 will maintain a copy of all user-invisible state information necessary to gracefully break 
the execution pipeline, execute a program in response to the interrupt request, and then restore the previous 
pipeline, without having to restart the instruction. A shadow image of relevant state and control information is 
stored in shadow registers. All shadow registers are invisible to the programmer's model except for the pcsh, the 
program counter shadow register. The pesh register may be examined by the programmer (when examined 
during the interrupt routine it points to the memory location to be fetched when exiting the interrupt routine; when 
examined during base-level processing it contains the same value as the pc). 


The first step of interrupt exception processing is the recognition of the interrupt. When one-or-more unmasked 
interrupts are present, the highest priority interrupt is recognized. The following set of events happen: 


1) the DSP3210 automatically saves the state of the machine that is invisible to the programmer (do loop status 
information, ...), as well as, the DAU accumulators, a0—a3 (including guard bits), and the dauc register in 
interrupt shadow registers, 

2) the program counter value for the instruction to be fetched after returning from the interrupt is saved in the 
pcesh register, 

3) interrupts are disabled. 
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The second step of interrupt exception processing is branching to the appropriate interrupt handling routine. To 
do this the following set of events happen: 


1) the exception vector is multiplied by 8 to obtain the table offset, 

2) the offset is added to the address in the exception vector table pointer by the instruction goto evtp + offset to 
obtain the memory address of the exception entry and the branch is executed, 

3) the goto is followed by a nop to account for branch latency. 


Internal states that are visible to the programmer (other than the DAU accumulators, a0—a3) and used in the 
interrupt level must be saved and restored by the interrupt service routine. 


The last step of interrupt exception processing is returning to the base processing level. This is done by 
execution of the ireturn instruction. The ireturn instruction: 


1) restores the state of the machine that is not visible to the user, as well as a0—a3 and the dauc register, 
2) fetches the instruction pointed to by the pcsh register, 

3) causes the next instruction to be executed to come from the instruction shadow register (irsh), 

4) re-enables interrupts. 


Three instructions are not interruptible: 

—CAU register load from memory or V/O register, or register load from memory 
e.g. rD = *rS or rD =iori or iort = *rs 

—Branch instructions, including goto, conditional goto, call and return 
e.g. goto rN 

—Stack pointer increment/decrement, 
€.g. Sp = (long) sp++ 


Figure 7-6 shows an example of the interrupt exception process. The row labelled IRSH represents the 
Instruction Shadow Register. 








goto evip +N 
Keo FF a SB NE SA 
Address Yo Zz 
Bus 










[DataBus| to] | | [Z-s[ xo | Yo] _| 
[iA [ [tof to] to | gotoevip+p | 
Desh [eff opie fT Pi paf | 


Interrupt Arbitration and Prioritization 
Interrupt Request Synchronization 


Figure 7-6 Interrupt Exception Timing 
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7.5.1.1 Quick Interrupt 


Each entry in the exception vector table may contain instructions. In general, these may be used for a goto and 
the delayed branch latent instruction. Alternatively, by using these two locations (and any entry pair following this 
that are not being used) for instructions, an interrupt routine with an overhead of three instruction cycles can be 
performed. An example of a quick interrupt operation performed by the DSP3210 while processing the instruction 
stream 1, 12, and I3 is shown below: 


" /* Base level instruction #1 */ 

l2 /* Base level instruction #2 

goto evtp + offset 7 overhead branching to interrupt 

nop /* latent branch instruction 

IR1 /* useful interrupt routine instruction 

ireturn /* no latent instruction executed after ireturn */ 
13 /* Base level instruction #3 */ 


7.5.2 Error Exceptions 


Error exceptions are handled using an abort exception model. The errors that can cause an error exception are a 
reset (power-on or RESTN asserted), bus error, illegal opcode, address error, DAU overflow or underflow, or the 
detection of a NAN on an IEEE to DSP32 format conversion. Error exception processing can be entered from the 
base level or from the interrupt level. 


The first step of error exception processing is the recognition of the error. When one-or-more non-maskable 
errors are present or maskable errors are present and unmasked, the highest priority error is recognized. The 
following set of events happen: 


1) the instruction currently executing is aborted 
a) do-loops are terminated if active, s 
b) pending memory transactions are cancelled, 
2) on-going memory transactions are completed (but the data is ignored), 
3) DMA operations are disabled (wait-for-interrupt is accordingly terminated), and 
4) interrupts are disabled. 


The second step of error exception processing is to branch to the appropriate error handling routine. To branch to 
the exception handling routine: 


14) the processor multiplies the exception vector number by eight to determine the offset, 

2) the offset is added to the address in the exception vector table pointer (evtp) by the instruction call evtp + 
offset (r20) to obtain the memory address of the exception entry and the call is executed, 

3) the pc is placed in r20, the error trace register, as a result of the call (the pc represents the address of the 
prefetched instruction), 

4) the call instruction is followed by a nop to account for branch latency, 


Figure 7-7 shows an example of the error exception process. In this example, an error is detected during state 1 
of lo. lo is aborted and the processor state is forced to state 3 for three clock cycles. This is referred to as the 
error state. Following the error state, the processor executes the call and nop instructions that vector it to the 
error handling routine. Table 7-3 shows the registers affected by error exceptions other than reset. Note that on 
a reset error exception, additional registers are cleared as described in 7.5.4 Reset. 


The cause of the error exception is broken into two types; non-maskable errors, and maskable errors. When an 
error exception is taken, the type of error is recorded so that if subsequent errors occur, the processor can 
respond to them. If the processor is in the error level caused by a maskable error and another maskable error is 
detected, the error is ignored. If the processor is in the error level caused by a maskable error and a non- 
maskable error is detected, an error exception is taken to the non-maskable error source handler routine. If the 
processor is in the error level caused by a non-maskable error and another non-maskable error occurs, the 
processor executes a double error exception (see 7.5.3 Double Error Exceptions). 
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The error level processing state can be exited by a software reset of the processor. A software reset of the 
processor consists of 1) configuring control and data registers to their reset state or any other desired state, 2) 
resetting the error level to the base level via the sftrst instruction, and 3) branching to 0 or any other desired 
location. The sole function of the sftrst instruction is to change the processor level to the base level. Steps 1 and 
3 are performed in software. 


7.5.2.1 Double Error Exceptions 


If during error exception processing of a maskable error or non-maskable error another maskable error is 
detected, the processor will ignore it. If during error exception processing of a maskable error a non-maskable 
error is detected, the processor will initiate another error exception. If during error exception processing of a non- 
maskable error another non-maskable error is detected, the processor will enter a double error exception. A 
double error exception will cause the processor to enter a state that can only be left by asserting the RESTN pin 
or powering down and powering up the DSP3210. The double error exception is signalled by the simultaneous 
assertion of IACKO and IACK1. 


Table 7-3 Registers Affected by Error Exceptions Other Than Reset 


[Register [Value [Comments 
120 Previous PC | Result of call evip + VN8 (r20).” 
pe evip + EN+8 | Result of call evtp + VN*8 (r20). 

tcon[0] (0) Timer decrementing is disabled. 
dmac[0,4} () SIO DMA is disabled. 


[pewli3}_ [0 Enable writing of pew register. 


* Nis the exception vector number. 
. 

[10 | Error State | callevtp+N(20) [nop 

Address Bus |_| tr | Es 


[Dstacus [o} || In] || 1111 [1 Je] 
lime | [wll { | [| [ catewp+nveq [nop 


oo Error Exception Detected 


Figure 7-7 Example of Error Exception Process 
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7.5.3 Description of Error Exception Sources 


The next sections describe each error exception in detail. Reset is a special case of the error exception and is 
discussed in Section 7.5.4 Reset. 


7.5.3.1 Bus Error 


A bus error occurs when an external memory transaction is terminated with BERRN asserted and SRDYN 
deasserted. When this code is detected, the current bus cycle is terminated and outstanding fetches are 
invalidated (see 6.4 Bus Error). 


7.5.3.2 Illegal Opcode 
The opcode of the instruction is defined as the upper 6-bits of the instruction (31—26). Seven of the 64 possible 


opcodes are defined to be illegal. 
[Bits 31—26 [Bits 31—26__| 
a= SSS 


06000000 0x00 
06000001 
0b000010 
0b001111 
0b010110 
06010111 
0b100010 










7.5.3.3 Address Error . 


An address error occurs when a 32-bit word is addressed with an address not divisible by 4, or a 16-bit half-word 
is addressed with an address not divisible by 2. 


7.5.3.4 DAU Underflow/Overflow - 


Overflow occurs if a DAU operation results in a number that exceeds the range of positive or negative numbers 
that are representable in the 40-bit format. An overflow occurs if the result is more positive than 2 — 2731 + 2127 of 
more negative than -2 + 2! 27_ Underflow occurs if a DAU operation results in a number that is smaller than the 
range of positive or negative numbers that are representable in the 40-bit format. An underflow occurs if the non- 
zero result is more negative than 1 + 2-127 and more positive than —[1 + 2°31}.2-127, 


7.5.3.5 NAN 


A NAN is detected in the IEEE to DSP32 floating-point conversion operation if the IEEE number is a NAN (e = 255 
and f # 0 ands = don't care). P 
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7.5.4 Reset 


The DSP3210 provides two reset mechanisms, a power-on reset circuit and an external reset pin, RESTN. The 
following terminology is needed to define reset operation: 


Power-on reset: The power-on reset circuit is active for the first 8 rising clock edges after power has been 
applied (VOD 2 3 V) to the device and it puts the chip in the reset state. After 8 clock cycles, 
the logic level of RESTN controls DSP3210 operation. If RESTN is low, the DSP3210 
remains in the reset state. If RESTN is high, the DSP3210 performs the reset sequence, and 
enters the base level processing state. 


Reset state: The DSP3210 is in the reset state when the internal reset signal is asserted, initializing the 
internal states of the chip. In the reset state, external memory transactions, bus requests, 
wait-for-interrupt, and error or interrupt exception processing are immediately terminated. 
Internal registers that are affected during reset are shown in Table 7-4. This state is entered 
when either power-on reset is detected, or RESTN is asserted. 


Reset Sequence: The reset sequence is the same as step two of the error exception process (see 7.5.1 Error 
Exceptions), and it consists of executing two internally generated instructions: 


(1) call evtp + 0 (r20) 
(2) nop. 


Table 7-4 Registers Affected by Reset 


[—___evp | __0____| Exception base address setto0 | 
[pe] 0 ___| Resuitofcallevip + 0(20) 
[——“dauopr—o] [| 0 | Roundete. SCS 















[pew] | __biof6] | Seto logic state applied to biol6] | 
[—pewi2] | _bio{5] | Seto logic state applied to bio(5}___| 
[___ em{i5—0] | 0 | Al maskabie exceptions masked | 
[——tconf7—0] | __0 __| Timer disabled 

[__iocjai—o] | 0 | Passive mode, 32-bit data lengih, eto. 
[__émaof7—oy | 0 |SIODMAdsabied 
| biooy7—o] | 0 | BiO pins configured as inputs] 






bio[1,3,5,7,9,11,13,15) | 0 _| BlO output register bits 
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7.5.4.1 Reset Timing 


A description of the reset pin follows: 


RESTN Reset (Open Drain; Active Low). The RESTN pin is an open-drain signal. 
When the power-on reset circuit is active, the DSP3210 asserts RESTN by 
pulling it down. The external system may also pull RESTN pin low at the same 
time or at other times to cause the DSP3210 to enter the reset state. RESTN is 
an asynchronous input that is internally synchronized to the falling edge of CKI. 
On a zero-to-one transition of RESTN, the reset sequence is initiated. The 
rising edge of RESTN is used to latch the signal values on pins bio[7—4] into 
pew[10—13], respectively. 


On power-on of the DSP3210, on-chip circuitry puts the device in the reset state for the first 8 clock cycles (the 
first 8 rising clock edges of CKI). At this time, the logic level of RESTN controls the DSP3210's operation. Figure 
7-9a shows the timing for power-on reset. Note that all bus interface signals are placed in the high-impedance 
state during the reset state and for the first 11 clocks periods of the reset sequence. Outputs and Biputs, other 
than the BIO and the Bus Interface signals, are placed in the high-impedance state while the power-on circuit is 
active only. No RESTN pulse is required to reset and start the DSP3210 running when using the power-on 
capability. The RESTN pin must be pulled to Vop through an external resistor. 


An externally controlled reset can be initiated at any time. RESTN is an asynchronous input that is internally 
synchronized to the falling edge of CKI. Figure 7-9b shows the timing for externally controlled reset operations. 
The minimum pulse width for RESTN is 4 periods of CKI. All BIO pins are immediately put in the high-impedance 
state so that signal values may be applied on the rising edge of RESTN. Outputs and Biputs other than the BIO 
and the Bus Interface signals may be placed in the high-impedance state by asserting ZN. 


For both power-on reset and externally controlled reset, the BIO[4—7] inputs must be valid prior to the rising edge 
of RESTN and held following it in order to guarantee correct operation and to initialize pew[13—10]. The rise time 
of RESTN between 0.4 V and 2.4 V must be less than 4.ns. With up to 50 pF of external load, a 1K Q resistor can 
be used. 


cKI 


Vop 


RESTN 


ZN 


BIO[0—7] 





BUS INTERFACE 
SIGNALS 


OUTPUTS & 
BIPUTS 


Figure 7-8A Power-On Initiates the Reset Sequence 


TAZ 





DSP3210 Information Manual 
Processor States 










ae 
- ae ee? 
ss =H Se eee 





BUS INTERFACE 
SIGNALS 


Figure 7-8B RESTN Initiates the Reset Sequence 
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8. DATA PROCESSING FUNCTIONAL UNITS 


The DSP3210 provides two data processing units; Control Arithmetic Unit (CAU) and Data Arithmetic Unit (DAU), 
The CAU performs integer arithmetic, and the DAU performs floating-point arithmetic and data type conversions. 
Detailed descriptions of these data processing units are contained in this section. 


8.1 Control Arithmetic Unit (CAU) 


The CAU is responsible for performing address calculations, branching control, and 16- or 32-bit integer arithmetic 
and logic operations. It is a RISC-like processor consisting of a 32-bit arithmetic logic unit (ALU), 32-bit Barrel 
Shifter, and a 32-bit wide register stack. 


8.1.1 CAU Programming Model 


The CAU is used in the execution of both classes of instructions: CA (Control Arithmetic) and DA (Data 
Arithmetic). CA instructions perform load/store, branching control, and 16- and 32-bit integer arithmetic and 
logical operations. CA instructions are performed solely by the CAU. DA instructions use the DAU to perform 
floating-point multiply/accumulate and special function operations and the CAU to generate addresses for the 
operands. Each DA instructions can have up to three operand memory accesses per instruction, and the CAU is 
responsible for generating the addresses for these operands. For a detailed description of the instruction set, see 
Chapter 4 Instruction Set. 


CAU REGISTERS 





31 15 7 ° 
rd ri5 
ri ri6 
12 17 
3 18 
r4 19 
i 120 
r6 
a 121 (sp) 
8 122 (evtp) 
r10 
aa Key: 
2 Coy interrupt sHavow 
13 . 
oF EJ RESERVED 
CONTROL REGISTERS 
31 15 7 0 


a Bd 


Figure 8-1 CAU Programming Model 
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Table 8-1 ps Register 


Bit, [eas [ia] 13. | ae | cian tonite ae aren p sansa) 2s lee 
Field [oO | 0 | in | wo | to | sy [oel w@ fv Tut z{n{~etv{z{o | 


CAU n flag; means that the ALU result is negative. 

CAU z flag; means that the ALU result is zero. 

CAU v flag; means that the ALU result overflowed. 

CAU ¢ flag; means that the ALU result had a carry out of the MSB. 
DAU N flag; means that the DAU result is negative. 

DAU Z flag; means that the DAU result is zero. 

DAU U flag; means that the DAU result underflowed. 

DAU V flag; means that the DAU result overflowed. 

Serial input buffer full flag; means that the serial input buffer is full. 
Serial output buffer empty flag; means that the serial output buffer is empty. 
Serial sync set flag; means that the SY pin is high (logic one). 

Serial frame boundary flag; means that the SIO is at a frame boundary. 
Pin IRON is high (logic one). 

Pin IR1N is high (logic one). 

Reserved - read as zero. 





The CAU register programming model consists of 24 CAU registers and one IO register (see Figure 8-1). The 
CAU register set consists of r0 (hard-wired to zero), ri—r22 (32-bit registers), pc (program counter)/pesh 
(program counter shadow). The IO register is the ps (processor status). The ps register is a 16-bit register and 
contains CAU flags, DAU flags, and IO flags. The CAU flags are read/writable. The DAU and IO flags are read- 
only. The position, encoding, and functional definition of the bits in the ps register are available in Table 8-1. 


Registers ri—r14 are used as general-purpose registers in CA instructions and as memory pointers (rP) in DA 
instructions. When used as memory pointers, ri—r14 hold 32-bit addresses. Registers ri5—r19 are used as 
general-purpose registers in CA instructions and as increment registers (rl) in DA instructions. When used as 
increment registers, r15—r19 hold 32-bit 2's complement values that can post-modify the memory pointers. 
Registers r0 , r20, r21, and r22 have special properties and can only be used in CA instructions. Register r0 is 
hard-wired to zero. Register r20 is the error trace register and it is used to hold the program counter address at 
which an error exception was taken. Register r21 is the stack pointer. Register r22 is the exception vector table 
pointer. The use of CAU registers in CA and DA instructions is shown in Figure 8-2. 













Instruction Type 


Purpose 








[x |r ace 








Figure 8-2 CAU Register Usage 
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8.1.2 CAU Functional Description 


Figure 8-3 illustrates the hardware architecture of the CAU. It consists of two sections, the CAU control section 
and the CAU data path. The CAU control section is responsible for the decoding of the instructions. It takes as 
inputs the instruction from the instruction register (IR), the status of the machine from the ps register, and the K 
count (used to hold the number of instructions to be repeated in a do loop). The CAU data path is responsible for 
executing the integer and address calculation operations of CA and DA instructions. It consists of a user- 
accessible register array, an ALU, a barrel shifter, the L and SA registers (used to store the iteration count and 
start address for do loops, respectively), and the Z-address delay line (used to delay the Z address for DA 
instructions). 

The arithmetic or logical operations provided by the ALU or Barrel Shifter are summarized in Table 8-2. All CA 
arithmetic instructions are performed on 32-bit 2's complement, integer operands. If flags are affected by an 
instruction, they are calculated based on 16-bit or 32-bit operation flag rules, depending on the size of the 
operation as specified in the instruction. For example, when performing operations using 16-bit arithmetic, (1) 16- 
bit data is loaded into the lower 16-bits of the 32-bit CAU register(s), with the most significant bit of the integer 
extended into the upper 16 bits; (2) 32-bit arithmetic is performed, with flags computed according to 16-bit 
operation flag rules; and (3) the results can be stored in memory by writing the lower 16-bits of the register to a 
16-bit memory location. A description of CAU flags is available in Section 4.1 Flags. 


Table 8-2 CAU Operations 


Shift 16-bit immediate left 16 binary places and OR 
Add 

Subtract 

Logical AND 

Logical OR 

Logical Exclusive OR 
Carry Reverse Add 

Logical Shift Left 

Logical Shift Right 
Arithmetic Shift Right 
Arithmetic right shift by 1 
Multiply by 2 

Rotate Left through Carry 
Rotate Right through Carry 
Increment/Decrement 
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CAU DATA PATH 







ADDRESS 
BUS (32) 


ncwWUrA>ro 


DAU {I/ CAU FLAG| 
LAGS] FLAGS] STATE 
ee! 


LEGEND: 

IR Instruction Register 0 Zero 

K K counter register nm—ri4 CAU general-purpose register, 
L L counter register DAU pointer register 

SA Start Address 5—1r19 CAU general-purpose register, 
IB Increment Bus DAU increment register 


UB Update Bus 120 CAU general-purpose, Error trace 
PB Pointer Bus sp (21) Stack pointer 

evtp(r22) Exception vector table pointer 

ps Processor Status 


Registers in upper case are not user accessible. [{y Shadowed during interrupts 





Figure 8-3 Block Diagram of the CAU 
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8.1.3 CAU Internal Operation 


The internal operation of the CAU is best understood by describing the operation of its instruction set. The CA 
instructions consist of three types: arithmetic/logical, data move, and control instructions. Arithmetic/logical 
instructions are performed as register to register operations. Many of the CA arithmetic instructions can be 
performed using three different registers: two source registers (rS1, rS2) and one destination register (rD). CA 
instructions may also be conditionally executed based on flags established in previous instruction cycles. Data 
move instructions specify 8-, 16-, or 32-bit data transfers between CAU registers and memory, CAU register and 
V/O registers, and /O registers and memory. Control instructions alter the program flow by modifying the pc 
(program counter). 


Figure 8-4 illustrates the CAU pipeline for executing two instructions of the form: 


mD-rS 
if(eq) rD = rS1 + rS2 


The first instruction subtracts the contents of the source register (rS) from the destination register (rD), to set/clear 
the CAU zero (z) flag. The second instruction checks the z flag, and if it is set (j.e., the result of the previous 
instruction was zero), adds the contents of rS1 and rS2, and stores the result in rD. If the flag is not set, the 
contents of rD are unaffected by this instruction. 


During execution of DA instructions, the CAU generates up to four addresses, one address in each of the four 
states of an instruction cycle. In each state, the CAU can add the contents of two registers: a pointer selected 
from ri—r14 (placed on the pointer bus) and an increment selected from r15—r19 or a fixed increment value 
(placed on the increment bus). The CAU employs a 3-stage pipeline to (1) fetch from a register(s), (2) operate on 
the fetched operand(s), and (3) store the result in a register. Because of this pipelining, the preceding pointer is 
updated (result placed on the update bus), while the next pointer and increment are being accessed. 


Figure 8-5 shows the CAU pipeline for executing a DA instruction of the form: 
Z=aN=aM+Y*X 


Since the result of the multiply-accumulate operation is not available on the data bus until state 0 of instruction 
cycle Is and the destination address is calculated during state 1 of instruction cycle lz, the address is latched and 
delayed for three instruction cycles before being placed on the address bus. This three instruction cycle delay is 
performed by the Z-address delay block shown in Figure 8-3. 


Except for the register load from memory and branch operations that have a latency of one instruction cycle, 
execution of a CA instruction is completed by the CAU before the next CA instruction begins execution. This 
simplifies use of the CAU for logic and control operations. 
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Pointer Bus 





Update Bus 
Address Bus 





For instructions in the form of: 
Ip: = rD-rS 
ly: if(eq) rD =rS1 + rS2 
where rD= destination register 
rS,rS1,rS2= source register 
CAU condition equal to zero (z flag) 
modification of register and flag is conditional 
program counter 
additional operation 
subtraction operation 


1+ 808 


Figure 8-4 CAU Internal Pipeline Timing - CA Instruction 





For instructions in the form of: 
Ip: *rPz++rlz = aN = aM + vende *rPytttly 


where rPz, rPy, rP,= pointer register for DAU - Z, Y, and X operands 
lz, tly, tly = increment register for DAU - Z, Y, X operands 


Pc= program counter 
+= additional operation 


Figure 8-5 CAU Internal Pipeline Timing - DA Instruction 
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8.2 Data Arithmetic Unit (DAU) 


The DAU is the primary execution unit for signal processing algorithms. This unit contains a 32-bit floating-point 
multiplier, a 40-bit floating-point adder, four 40-bit accumulators, Data Type conversion hardware, and two user- 
accessible registers, dauc (DAU control register) and ctr (Clip-Test register). The multiplier and adder work in 
parallel to perform 16.7 million computations per second of the form (a=b +c *d). The multiplier and adder each 
produce one result per instruction cycle. The DAU contains a four stage pipeline: Fetch, Multiply, Add, and Store. 
Thus, in any instruction cycle, the DAU may be processing four instructions, each in a different stage of execution. 


8.2.1 DAU Programming Model 


The register programming model for the DAU consists of 4 DAU registers, called accumulators, and two ie) 
registers, the dauc, DAU control register and the ctr, clip-test register. Figure 8-6 shows the register programming 
model for the DAU. Note that all registers in the DAU are shadowed during interrupts. Each accumulator is a 
40-bit register consisting of a 32-bit mantissa and an 8-bit exponent. 


The dauc is used to select the 8-bit data type conversion performed for the instructions Ic and oc, the truncation 
mode to be used when converting floating-point data to integer, and how writes are handled in conditional DA 
instructions, aeq, alt, and agt. Linear byte conversion, \1-law, and A-law share the same instruction encoding 
(Z=aN=ic(Y);) and are therefore distinguished only by the dauc setting. Choosing linear byte input and/or output 
mode overrides the corresponding j-law/A-law setting. Loading the dauc register from memory affects the 
following instruction. The meaning for the settings of the dauc register are given in Table 8-3. The dauc register 
is cleared on reset. 

The ctr stores the last six values for the DAU N flag. It is used for 2-D and 3-D clip-test algorithms to compute the 
outcode. This outcode can then be read into a CAU register for further operations. The meaning for each bit of 
the ctr register is given in Table 8-4. 


DAU REGISTERS CONTROL REGISTERS 
39 15 7 0 


31 
a0 [mantissa] guard | exp | iorl4 (dauc) [XK ——_ 
¢ TT) ere en, are 
sf Ca a 

a3 Sg IneRRUPT SHADOW RW READIWRITE 


E33) RESERVED R READ-ONLY 


Figure 8-6 DAU Programming Model 
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Table 8-3 dauc Register 


Bit 7 6 5 Cee ee A) 
Ce | a COND ROUND’ CONV 


pi-law input conversion. 

A-law input conversion. 

Unsigned linear byte input conversion. 
u-law output conversion. 

A-law output conversion. 


Unsigned linear byte output conversion. 

Round-to-nearest on float-to-integer conversions. 

Truncate towards -< on float-to-integer conversion. 

Truncate towards 0 on float-to-integer conversion. 

DA instructions ifalt, ifaeq, and ifagt have unconditional Z-writes. 
DA instructions ifalt, ifaeg, and ifagt have conditional Z-writes. 
Must be set to zero; Read as zero. 








Table 8-4 ctr Register 


Bit 7 6 5 4 3 2 1 0 
FRG te | nO sc OD fun Not | N= Neen NC [NT [TNO] 








Current DAU N Flag 

DAU N Flag computed for Instruction '-1" 
(previous DA instruction that set flags) 
DAU N Flag computed for Instruction '-2' 
DAU N Flag computed for Instructiot 
DAU N Flag computed for Instructio 
DAU N Flag computed for Instruction '-5' 
Reserved - read as zero 





8.2.2 DAU Functional Description 


The DAU is composed of two sections, the DAU control section and the DAU data path. The DAU control section 
is responsible for decoding the instructions and pipelining the control signals through the four stages of execution. 
The DAU data path is responsible for performing the floating-point arithmetic and data type conversions. Figure 
8-7 illustrates the hardware architecture of the DAU. 


The DAU supports two floating-point formats, single precision (32-bit) and extended single precision (40-bit). 
Extended single precision provides 8 additional mantissa guard bits. The four 40-bit accumulators, a0—a3, 
reduce round-off problems to ensure 24-bit precision. Post-normalization logic transparently shifts binary points 
and adjusts exponents to prevent inaccurate rounding of bits when the floating-point numbers are added or 
multiplied, eliminating concerns like scaling and quantization error. All normalization is done automatically, so the 
result in the accumulator is fully normalized. 

The DAU contains both 32-bit and 40-bit registers and buses to support two floating-point formats, 32-bit and 40- 
bit extended precision floating-point. The 40-bit format provides 8 additional mantissa guard bits used by the DAU 
in accumulate operations. Figure 8-8 shows the two floating-point formats. 

When a 40-bit bus connects to a 32-bit register or multiplexer, the guard bits are excluded (truncated). For 


example, when writing a 40-bit accumulated result to memory, the result is first truncated to 32-bits. When a 
32-bit register drives a 40-bit bus, the guard bits (bits 15—8) are zeroed on the 40-bit bus. 
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DATABUS 32 


DAU 
DATA PATH 


(32) 


FLOATING- 
POINT 


MULTIPLIER TYPE CONV 


CONTROL 


FLOATING- 
POINT 
ADDER 


=o a 


LEGEND: 


IR Instruction Register Multiplier input register 

IR1 IR for instr. cycle 1 Multiplier input register 

IR2 IR for instr. cycle 2 Product register 

IR3 IR for instr. cycle 3 Special function register 

IR4 IR for instr. cycle 4 dauc DAU Control Register 

DU1,DU2 Input operand delay ctr Clip-test Register 
registers a0—a3  Accumulators O—3 


Registers in upper case are not user accessible. [2] Shadowed during interrupts. 





Figure 8-7 Block Diagram of the DAU 
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B. 40-Bit Floating-Point Format - 
Figure 8-8 DAU Floating-Point Formats 


Each DAU multiply-accumulate instruction involves three floating-point operands. Two of these operands are 
multiplied together, and the result is added to the third. The result of the addition is stored in an accumulator. 
The value in this accumulator may be an intermediate result used in later multiply-accumulate operations, or it 
may be a final result to be stored in memory. 


The DAU multiplier inputs can originate from memory or an accumulator. Constant values (e.g. 1.0, 0.0) may also 
be selected as multiplier inputs. Multiplier inputs are 32-bit floating-point numbers with a 24-bit mantissa and an 
8-bit exponent. If an accumulator is input to the multiplier, the accumulator guard bits are first truncated. The 
multiplier always provides one input to the adder. The other adder input can originate from memory, or an 
accumulator. Constant values (e.g. 1.0,0.0) may also be selected as an adder input. This input is either a 32- or 
40-bit operand, depending on its source. 40-bit operands are provided by the four accumulators, a2—a3. See 
8.2.3 Floating-Point Arithmetic Precision for more details about the precision of the operands in the DAU. 
Although the multiply-accumulate structure is rigid, there is flexibility in the way operands are loaded into the three 
inputs. 

Single instruction, data type conversions are done in hardware in the DAU reducing overhead required to do 
these conversions. The DAU performs data type conversions between the DSP32 32-bit floating-point format and 
the folowing: 


—IEEE P754 standard 32-bit floating-point 
—16- and 32-bit integer 

—8-bit u-law and A-law formats 

—8-bit unsigned integer 


The DAU also provides an instruction to convert a 32-bit floating-point operand to a 3-bit seed value used for 
reciprocal approximation in division operations. 
8.2.3 Floating-Point Arithmetic Precision 


Each DAU multiply-accumulate instruction (of the form A = S + Y * X) involves three floating-point operands. Two 
of these operands (X and Y) are multiplied together, and the result (P) is added to the third (S). The result of the 
addition is stored in one of four accumulators (a0—a3). The contents of this accumulator may be an intermediate 
result used in later multiply-accumulate operations, or a final result to be stored in memory. 
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Figure 8-9 illustrates the precision of the floating-point operands and results involved in DAU multiply-accumulate 
operations. (Because precision is a function of the mantissa length, discussion of the exponent is omitted here.) 
An implied binary point is included in the figure. Bits to the right of the binary point constitute the fractional part of 
the number, while those to the left constitute the integer part. 


Recall that the magnitude of the mantissa (M), in the multiplier inputs (X and Y), adder input (S) and adder result 
(a0—a3) is always normalized such that -2 <M<-1 or1<M<2. Thus, the leading 1 (positive mantissa) or 
leading 0 (negative mantissa) does not appear explicitly in the floating-point word, and is called an implicit bit. 


The result of a multiplication (P) could have a denormalized mantissa (e.g. in cases such as —2 *-2 = 4 and 
-1*1=-1), and thus three integer bits are retained in P. Normalization is done automatically in the adder, so the 
accumulated result is fully normalized. The adder inputs (P and S) and result (a0—a3) contain eight mantissa 
guard bits in addition to the standard 24 fractional bits in the mantissas. 


If the result of a multiply-accumulate operation is written to memory, its guard bits are truncated. If an 
accumulator is input to the multiplier, its guard bits are also truncated. There is an option to first invoke the round 
instruction on the accumulator containing the result. The guard bits are useful when the accumulator is input to 
the adder. Here, the full width accumulator, including guard bits, is used. 


X Operand 
(Multiplier Input) 


(=| {=] 


Y Operand 
(Multiplier Input) 





Truncate 
14LSBs 


P Product 

epee er 
forade 5 

(Adder Input) 








guard 
(Adder Result) 
k—_| 
guard 


1 denotes implicit bit 


Figure 8-9 Precision of DAU Operations (mantissas only) 
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8.2.4 Data Type Conversions 


Several data type conversions that have traditionally added considerable software overhead are handled by the 
DSP3210 in hardware. The DAU provides single-cycle instructions that perform data-type conversion between 
the DSP32 internal 32-bit floating-point format and the IEEE P754 standard 32-bit floating-point format, the 8-bit 
companded p- and A-law formats, the 8-bit unsigned linear format, and the 16- and 32-bit integer formats. The 
following sections describe these operations in more detail. 


8.2.4.1 IEEE Standard Floating-Point 


Although the DSP3210 has its own internal floating-point format, the DSP3210 is compatible with the IEEE P754 
standard 32-bit floating-point format. The device contains on-chip hardware that supports format conversion to 
and from the IEEE format in a single instruction cycle. The single-precision IEEE standard floating-point format 
has a sign-magnitude mantissa and an 8-bit biased exponent. Figure 8-10 shows this format. 


The floating-point format used by the DSP3210 is referred to as the DSP32 format. Itis the format used by the 
DSP32, DSP32C, and the DSP3210. Conversions between the IEEE standard and DSP32 formats adhere to the 
following rules: 


—If the exponent of the IEEE number is 255, the number is treated as a positive or negative infinity according to 
its sign. If positive, the converted DSP32 number is the largest positive representable number in the DSP32 
format (Ox7fffffff). If negative, the converted DSP32 number is the most negative representable in the DSP32 
format (0x800000ff). 


—DSP32 to IEEE conversions never result in NaN (not a number). However, if the IEEE format input operand in 
an IEEE to DSP32 conversion operation is NaN, the associated NaN error bit in the error source register is set. 


—If the exponent of an IEEE number is zero, the converted DSP32 number is 0x00000000, or positive zero. The 
IEEE standard includes denormalized numbers and a negative zero, i.e., the exponent of the IEEE is zero, but 
the sign and/or mantissa are not zero. Again, in both cases, the converted DSP32 number is 0x00000000 or 
positive zero. The DSP32 format does not recognize a negative zero. Denormalized IEEE numbers are not 
supported. 





where (z=IEEE number) 
—lIf e=255 and f40 then z is NaN (not a number) regardless of s. 
—If e=255 and f=0 then z= (-1)2127 
—If 0<e<255, then z= (-1)8 2° - 127 (1) 
—lIf e=0 and f#0 then z=(-1)s 2-126 (0.f) 
—lf e=0 and f=0 then z=(—1)*0 (zero). 





Figure 8-10 IEEE Standard 32-bit Floating-Point Format 
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8.2.4.2 Companded 
On-chip hardware supports the conversion to and from two 8-bit compressed PCM data formats: y-law and A-law. 
jelaw 


The format of an 8-bit, j1-law word is as follows: 





mo mi m2 m3 nO ni n2 s 
This p-law word represents the following number: 
Y = (-1)8 * ((16.5+M) * 2N — 16.5] 


where: 
M=m3m2m1 m0 (Mis 0 to 15) 
N=n2n1 n0 (Nis 0 to 7) 
* = multiplication 
— = bar (complement) 


A-law 


The format of an 8-bit, A-law word is as follows: 





moO mi m2 m3 nO ni n2 5 
This A-law word represents the following number: 


Y = (+1) * ((16.5+M) * 24) Ne 1,2) 0057 
Y =(-1)8* (0.5 +M)*2 N=0 


where: 
M=m3 m2 m1 m0 (Mis 0 to 15) 
N =n2n1 n0 (Nis 0 to 7) 
multiplication 
— = bar (complement) 






8.2.4.3 8-bit Unsigned Integer 


The DAU can perform conversions between 8-bit unsigned integers and DSP3210 floating-point values in a single 
instruction cycle. The 8-bit integer values are assumed to be in the range 0 to 255 when converting to floating- 
point. Floating-point values are converted to an 8-bit unsigned integer. If the floating-point operand exceeds the 
range of allowable 8-bit unsigned integers (0,255), a saturated 8-bit integer is returned (OxFF for the most positive 
or 0x00 for the least positive). Negative floating-point numbers are converted to 0x00. Based on the dauc 
register, floating-point numbers are converted to 8-bit unsigned numbers by rounding or truncation (See 8.2.5 
Rounding Modes). If the 8-bit integer result is written to memory, it is loaded into an 8-bit memory location. 


8.2.4.4 16-bit Integer 


The DAU supports conversion from a floating-point operand to a 16-bit 2's complement integer. If the 16-bit 
integer result is written to memory, it is loaded into a 16-bit memory location. If the floating-point operand 
exceeds the range of allowable 16-bit integers (-2'5, 215 _ 1), a saturated 16-bit integer is returned (0x8000 or 
Ox7fff, respectively). Based on the dauc register, floating-point numbers are converted to 16-bit linear by rounding 
or truncation (See 8.2.5 Rounding Modes). Similarly, the DAU supports conversion from a 16-bit integer to the 
equivalent 32-bit floating-point representation. 
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8.2.4.5 32-bit Integer 


The DAU supports conversion from a floating-point operand to a 32-bit 2's complement integer. If the 32-bit 
integer result is written to memory, it is loaded into a 32-bit memory location. If the floating-point operand 
exceeds the range of allowable 32-bit integers (-231, 231 — 1), a saturated 32-bit integer is returned (0x80000000 
or Ox7fffffff, respectively). Based on the dauc register, floating-point numbers are converted to 32-bit linear by 
rounding or truncation (See 8.2.5 Rounding Modes). Similarly, the DAU supports conversion from a 32-bit integer 
to the equivalent 32-bit floating-point representation. 


8.2.4.6 Reciprocal Seed 


In the DSP3210, a divide operation can be performed by calculating the reciprocal of the divisor, and multiplying 
this value by the dividend. An instruction is provided to convert, in hardware, a 32-bit floating-point number to a 
32-bit seed value as a first approximation to the reciprocal of the number. This one-cycle instruction produces a 
seed with a mantissa accurate to 3 bits. 


8.2.5 Rounding Modes 


On conversions from floating-point to 32-bit or 16-bit signed integers, or to 8-bit unsigned integers, the dauc 
register (see Table 8-3) controls whether the integer result is rounded to nearest, truncated towards -c, or 
truncated towards zero. Figure 8-11 graphically displays the differences between the three modes. 


With round-to-nearest, the resulting integer is incremented if there is a one in the most significant fraction bit of 
the denormalized floating-point number. Note that this rounds 0.5, 1.5, 2.5,... to 1, 2, 3,..., respectively and —0.5, 
-1.5,-2.5,... to 0,-1, -2,..., respectively. 


With truncate towards -<, the fraction bits of the denormalized floating-point number are discarded. 


With truncate towards zero, the fraction bits of the Uenormalized floating-point number are treated differently 
based on whether the number is positive or negative. For positive numbers, the fraction bits are discarded. For 
negative numbers, the integer is incremented by one if there were any 1s in the fraction bits. 


On conversion of 8-bit unsigned integers, there is no difference between truncate towards zero and truncate 
towards -ce because all negative numbers are saturated to zero. 


Note that the round() instruction rounds a 40-bit floating-point number to a 32-bit floating-point number, and itis 
always performed as round-to-nearest, regardless of the dauc setting. 
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-2.0 -1.5 -1.0 05 0.0 05 1.0 1.5 2.0 FLOATING-POINT 
ROUND-TO-NEAREST 
-2 -1 ° 1 2 INTEGER 
-2.0 -1.5 -1.0 05 0.0 0.5 1.0 15 2.0 FLOATING-POINT 
TRUNCATE TOWARDS -2 
-2 = to) 1 2 INTEGER 
-2.0 -1.5 -1.0 05 0.0 05 1.0 1.5 2.0 FLOATING-POINT 
TRUNCATE TOWARDS 0 


-2 4 0) 1 2 INTEGER 
Figure 8-11 Floating-Point to Integer Mapping for All Rounding Modes 


8.2.6 DAU Internal Operation 

The following section presents a description of a single instruction as it passes through the DAU pipeline. The 
DAU supports four multiply-accumulate instruction formats, providing flexibility in choosing operand and storing 
results. To simplify this discussion, one format will be discussed. The DSP3210 executes the multiply- 
accumulate instruction 


aN =aM+Y*X 


in three stages (an optional fourth stage to perform a write to memory is discussed later); fetch of X and Y, 
multiplication of X and Y, and accumulation of the Y * X product with the contents of an accumulator, aM. This 
instruction executes as follows: 


step 1) X,Y fetch 
step 2) multiply Y * X 
step 3) accumulate product with aM 
If several multiply-accumulate instruction are executed one after the other, the DSP3210 automatically pipelines 


the instructions so that one instruction is completed every instruction cycle. This is shown in the following block of 
instructions where X1-X4 and Y1-Y4 are subscripted to make it easier to follow the data flow in the DAU. 


1) aN = aM+ Y1* X1 
2) aN = aM + Y2* X2 
3) aN = aM + Y3 * X3 
4) aN = aM + Y4* X4 
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The execution of these instructions is shown below: 


1) X,Y fetch 
2) multiply X,Y fetch2 
3) accumulate1 multiply2 X,Y fetchs 
4) accumulate2 multiplys X,Y fetcha 
5) accumulates multiplys 

6) accumulates 


If an accumulated result is intended to be used as an input to the multipier, the programmer must be aware of 
when an accumulate is finished. For example, the accumulation of instruction 1 will be available as a multiplier 
input for instruction 4. This latency is discussed further in Section 4.4.2 Latency Effects. An accumulated result 
may always be used as an accumulator input in the next instruction. 


The DSP3210 pipeline may extended to four stages, allowing the contents of an accumulator, multiplier input, or 
adder input to be written to memory without requiring an additional instruction. This a a key feature when 
performing such tasks as windowing, adaptive filtering, matrix operations, etc. To illustrate the additional stage, 
another column for the write is added to the previous pipeline diagram. 


1) X,Y fetch1 
2) multiply1 X,Y fetcha 
3) accumulate multiply2 X,Y fetchs 
4) writet accumulate2 multiplys X,Y fetcha 
5) write2 accumulates multiplys 

6) writes accumulates 


The syntax of such an instruction is 
Z=aN=aM+Y*X 
Note that the memory location written by instruction 1 will be available as multiplier input in instruction 5. 


The internal pipeline timing of the DAU for a single instruction of the form Z = aN = aM + Y * Xis shown in Figure 
8-12. An instruction requires five instruction cycles (lo—I4) to be decoded and executed in the DAU. The 
instruction (I) appears on the data bus in state 3 preceding lo. The X and Y registers are loaded in states 1 and 2 
of It, respectively. The contents of these registers are multiplied during l2 and the product is loaded into the 
product register, P, in state 3 of lz. The adder input register, S, is also loaded in state 3 of l2. The contents of the 
P and S registers are added during I3 and the result is loaded into an accumulator in state 3 of Is. This same 
result is then placed on the data bus (which is routed to a destination in memory) in state 0 of I4, if a Z-field is 
specified in the instruction. The Z-field is optional. 
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The DAU supports two multiply-accumulate instruction formats called tap-update instructions. In one format, 
aN = aM+ (Z=Y)*X 


instead of the result of the multiply-accumulate operation being written to memory, the Y operand.is written. It 
must be written in the same cycle that the multiply-accumulate result would have been written (state 0 of I4). 
Because the Y operand appears on the data bus in state 2 of I1, it must be latched and held for three instruction 
cycles before being written to memory. This three instruction cycle delay is performed in the tap-update delay 
block in Figure 8-7. The other tap-update instruction has the form: 


aN = (Z= Y) +X. 


The four-stage pipeline in the DAU contributes to the high throughput of the DSP3210. 





For instructions of the form: Z = aN = aM + Y * X, 
where X,Y= data reference (memory, a0—a3) 


aM= a0-a3 
aN= a0-a3 
Z= data write (memory) 
P= product 
S= adder input 


Figure 8-12 DAU Internal Pipeline Timing 
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9. PERIPHERAL FUNCTIONAL UNITS 


The DSP3210 provides four on-chip peripheral units; serial /O (SIO), DMA controller (DMAC), 32-bit timer 
(TIMER), and bit /O (BIO). Access to peripheral functional units is memory-mapped. Detailed descriptions of 
these facilities are contained in this section. 


9.1 Serial /O 


The serial /O (SIO) section provides synchronous, double-buffered serial ports for concurrent input and output of 
digital data. Inputs and outputs can be handled under program control, DMA mode, or interrupt mode. The serial 
VO can interface to a TDM (Time Division Multiplexed) line, serial codecs, or other DSPs (DSP32, DSP32C, 
DSP3210, DSP16, DSP16A) with few, if any, additional components. 

The SIO is divided into three functional units, the serial input port (SIP), the serial output port (SOP), and the SIO 
clock generator (see Figure 9-1). There is a register, ioc, that allows the programmer to configure these functional 
units for various modes of operation. Registers that are accessible to the programmer are denoted by names in 
lower-case. 

The description of the SIO is divided into five sections: SIO programming model, pin descriptions, serial input port, 
serial output port, and on-chip SIO clock generator. This chapter also includes a section on special purpose SIO 
flags and error condition reporting. The remainder of the SIO section describes programmed, DMA, and interrupt 
modes of handling serial I/O activity. 


9.1.1 SIO Programming Model 
The programming model for the SIO consists of three MMIO registers: ibuf (MMIO[0x0404)), obuf 
(MMIO[0x0408)}), and ioc (MMIO[0x040C)). “The MMIO address corresponds to the 16 LSB's of the 32-bit 
address. The address is given for both big and little endian byte orderings. The byte order is specified by pew(8]. 
The upper 16-bits are a function of the memory mode configured in the pew register. The register programming 
model for the SIO is pictured below: 

Register Address Address Register Bits 


Name Big Endian _Little Endian 
31 24 23 16 15 87 0 


* Shows 32-bit read address. 
+ Shows 32-bit write address. 
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The ioc (input/output control) register is a 32-bit read-write register that configures the serial input port, the serial 
output port, and the on-chip SIO clock generator. The position, mnemonic, encoding, and the definition of each 
bit field of the ioc register are described in Table 9-1. The serial input buffer, ibuf, may be read as an 8-bit, 16-bit, 
or 32-bit register. Referencing ibuf is a function of bit mode (least significant bit firsvmost significant bit first), byte 
ordering, and data size. When ibuf is used as an 8-bit or 16-bit register, valid data is read as follows: 


LE 8 31—24 0x0407 
LE 8 7—0 0x0404 
LE 16 31—16 0x0406 
LE 16 15—0 0x0404 


Referencing obuf is a function of byte ordering and data size. When obuf is used an an 8-bit or 16-bit register, 
valid data is written as follows: 


Mode Byte Ordering Data Size Register Bits 
MSB/MSB BE 8 15—0 0x040a 
is 8 15—0 0x0408 
LSB/MSB BE 8 Ti) 0x040b 
LE 8 7—0 0x0408 
The serial output buffer, obuf, may be written as an 8-bit, 16-bit, or 32-bit register. In the remainder of this 
chapter, the mnemonics ibuf, obuf, and ioc will be used exclusively and not the MMIO address. These registers 
may be read or written using any instruction supporting memory references. The DMA controller uses the ibuf 


and obuf as source and destination registers, respectively, for DMA transfers. Reset clears the ioc register to all 
zeros, but has no effect on the ibuf and obuf registers. 










Mode 





LSB 
LSB 
MSB 
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SERIAL INPUT PORT 




















Dl 
te IBF FLAG 
ILD 
ibuf (8/16/32) 

MMIO BUS 

| DATA BUS 
po. FI [ _obut ie, 16,32) | 

OBE FLAG 


FRAME BOUNDARY FLAG > TO CAU 
SY FLAG 


LOSS OF SANITY ERROR TO 
ERROR 
LOSS OF SY ERROR HANDLER 















Legend: 
ISR Input Shift Register OSR Output Shift Register 
ibuf Input Buffer obuf Output Buffer 


ioc Input/Output Control MMIO Memory-Mapped I/O 








Figure 9-1 SIO Block Diagram 
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Table 9-1 ioc Register 
















Bit | 31-24 | 23-20 17 | 16] 15 | 14 en|ove [ke | 4) se. | tho 
Fed [Res [| ICN | 08z|1Sz] Our] IN [SAN] tic [80 [oueNT Aot | aoc Tien | alt [ Atc | SLENT Bc | ASY | 


If0, SY is external. If 1, SY is internal. When generated internally, SY = {ICK, OCK}/{256, 
512, 1024}, based on ioc[1] and ioc[3,2]. 

If 0, ICK is used to derive the internal load and SY signals. If 1, OCK is used to derive the 
internal load and SY signals. 

These bits select the frequency ratio of the on-chip load signal to the on-chip SY signal. 


ratio = 32 

ratio = 8 

ratio = 16 

ratio = 32 
If 0, ICK is external. If 1, ICK is generated internally with a frequency specified by ioc[23—20]. 
If 0, ILD is external. If 1, ILD is generated internally with a frequency of {ICK,OCK}/32, based 
on ioc[1]. 
These bits specify the length of the serial input data. 


input length is 32 bits (prior to ILD) 
input length is 8 bits (after ILD) 
input length is 16 bits (after ILD) 
input length is 32-bits (after ILD) 
If 0, OCK is external. If 1, OCK is internally generated with a frequency specified by 
ioc[23—20]. ‘ 
If 0, OLD is external. If 1, OLD is internally generated: (1) with a frequency of {ICK,OCK}/32, 
based on ioc[1] if ioc[13] = 0, or (2) in a burst-mode if ioc[13] = 1. 
These bits specify the length of serial output data. 
iooft2] [ioc[it} T iocftoyf 
reserved 
output length is 8 bits 
output length is 16 bits 
output length is 32 bits 
output length is 24 bits 
reserved 
x reserved 
If 0, the internal OLD is {ICK,OCK}/32, based on ioc[1]. If 1, the internal OLD is burst-mode, 
generated as a function of the status of the obuf and OSR. 
If 0, the ICK is true (not inverted). If 1, the ICK is inverted. 
If 0, clear sanity bit. If 1, set sanity bit. 
If 0, the LSB is received first during serial inputs. If 1, the MSB is received first during serial 
inputs. 
If 0, the LSB is transmitted first during serial outputs. If 1, the MSB is transmitted first during 
serial outputs (can't be used with 24 bit output length). 
If 0, the input DMA transfer size is 32-bits. If 1, the input DMA size is specified by ILEN. 
If 0, the output DMA transfer size is 32-bits. If 1, the output DMA size is specified by OLEN. 
These bits specify the divide-down rate of the internal clock generator. The internal clock 
generated has an additional divide-down by four, i.e., if bits 23—20 represent N, the clock rate 
is N*4. If ICN=0, the active clock generator is disabled. 
Reserved; Read as zero. 
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Table 9-2 Pin Descriptions 
[ Pin [Name 


Ba Data Input 


Input Clock 
Input Load 


Data Output 


| 50 _| 
a Output Clock 










Input for serial digital data. Data is sampled with the rising edge of ICK if IIC = 0, 
or the falling edge if IC = 1. 
Clock for serial digital input data. Serial input port activity is based on ICK. 





















Clock for serial digital output data. Serial output port activity is based on the 
rising edge of OCK. Depending on the ioc setting, this signal may be either an 
it 












A high-to-low transition on this pin causes the loading of the output shift register 
from the output buffer. Depending on the ioc setting, the signal can be either an 
input (external mode) or an output (internal mode). 

Provides frame synchronization either to the DSP3210 from peripheral 
components (external mode) or from the DSP3210 to peripheral components 
(internal mode), depending on the ioc setting. The logical state of this pin may be 
tested by CA instructions (see Section 4.3 CA Instructions). 







9.1.2 Pin Descriptions 


The SIO has seven pins. Three pins are associated with the SIP, three with the SOP, and one with the SIO clock 
generator. Two pins in each of the input and output sections (corresponding to the bit clock and load signal) and 
the pin for the SIO clock generator (corresponding to the frame signal) are bidirectional. These signals are 
denoted as internal and are an output if they are generated by the DSP3210. These signals are denoted as 
external and are an input if they are supplied from an outside source. Table 9-2 is a description of the seven pins 
associated with the serial I/O. For each pin, the pin name, full name, and a description of its function are given. 


9.1.3 Serial Input Port 


The serial input port (SIP) is used by the DSP3210 to receive serial data from external devices or data streams. 
The serial input port consists of the input buffer (ibuf), the input shift register (ISR), the input load shift register 
(ILSR), and associated logic, as shown in Figure 9-2. The SIP supports a synchronous three wire protocol: clock 
(ICK), load (ILD), and data (DI). An internal flag, input buffer full (IBF), is also available to indicate the status of 
the ibuf. 


The edge of the clock used to clock data in the SIP is configurable by ioc[14], IIC. When lIC=0, the serial input 
port is sensitive to the rising edge of ICK. When IIC=1, ICK is inverted on-chip making the port sensitive to the 
falling edge of ICK. Serial data in (Dl) and the load signal (ILD) are clocked into the ISR and ILSR, respectively. 
The load signal is delayed in the ILSR at the bit clock rate and is used to generate a signal to load the ibuf from 
the ISR. The number of clocks that the load signal is delayed is a function of the input length (ILEN) specified in 
the ioc register. When the IBUF is loaded, the input buffer full (IBF) flag is set high. When the IBUF is read by a 
DSP3210 program, interrupt routine, or DMA facility, the IBF flag is set low. Together, the ISR and IBUF form a 
double buffer so data can be shifted into the ISR while data is waiting to be read from the IBUF. 
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9.1.3.1 Configuring the Serial Input Port 


ICK and ILD can be supplied off-chip (external mode), from an external peripheral, or on-chip (internal mode) from 
the on-chip SIO clock generator (see 9.1.5 SIO On-chip Clock Generator). The selections of internal or external 
mode for ICK, ioc[4], and ILD, ioc[5], are totally independent of each other. The internal clock generator, as 
configured by ioc[3-0] and ioc[23—20], generates the internal bit clock, load signal, and sync signal. The internal 
bit clock rate can be set for rates between CKI/4 and CKI/60 via ioc[23—20], ICN. The internal load signal is a 
divide by 32 of either ICK or OCK, based on ioc[1]. Note that with a continuous bit clock, the internal load will 
occur every 32 periods of the bit clock. The ICK can be configured to be true or inverted by ioc[14], IC. When 
ICK is true, the SIP is sensitive to the rising edge of ICK. When ICK is inverted, the SIP is sensitive to the falling 
edge of ICK. 


The length of the input data can be specified to be 8, 16 or 32 bits via ioc[6,7], ILEN. If ILEN is set to 00, then the 
input data corresponds to the 32 bits prior to the ILD high-to-low transition. This mode can be used for data 
lengths other than 8, 16, and 32 bits by masking off the undesired bits. For the 01,10, and 11 cases, the input 
data corresponds to the 8, 16, or 32 bits after the ILD high-to-low transition, respectively. These two protocols for 
serial input are shown in Figure 9-3 (IC = 0, positive edge-triggered, is shown). The order of the serial input data 
bits can be specified as least significant bit first, LSB, or most significant bit first, MSB, via ioc[16], IN. IN is used 
to control the direction of the bidirectional shift register ISR and select which bytes of the IBUF are put on the 
internal Data or MMIO Bus when the input buffer is read. Input data resides in the ibuf register bits as a function 
of LSB or MSB mode and the size specified. Valid data in the input buffer is organized as follows: 


31 16 15 0 
LSB 32-Bit 
MSB 32-Bit MSB tsB 
LSB 16-Bit MSB fa ike nee oe 
MSB 16-Bit 
\sB git 
MSB 8-Bit ice ss ae ee LSB 


The Data Bus is used for DMA controlled transfers and the MMIO bus is used for programmed transfers. 
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INTERNAL DATA BUS 







MMIO BUS 










Key: 32 

AIL ioc[5] 

ILEN _ ioc[6-7) ibuf (8, 16, 32) nw IBF 
IN ioc[16] . 






DI 


ICK 
AIC 






INTCLK 


ILD 


INTLOAD 


ILEN 


Figure 9-2 Serial Input Port 


9.1.3.2 Serial Input Timing Specifications 


The timing requirements and characteristics for serial input transmissions are shown in Figure 9-4 (lIC = 0, not 
inverted). The specifications are described as follows: 


ICK Clock Requirements: The clock requirements are the period of the input data bit clock ICK (tICKLICKL), 
the high time of ICK (tICKHICKL), and the low time of ICK (tlICKLICKH). 


ILD Requirements to Initiate an Input Transmission: ILD is set up high before the rising edge of ICK 
(tILDHICKH), and is held high until after the rising edge of ICK (tICKHILDL). ILD is set up low before the next 
rising edge of ICK (tILDLICKH) and is held low until after that edge of ICK (tICKHILDH). Similar requirements 
must be met when IIC=1 (negative edge-triggered). 


DI Requirements: Serial input data bits 0 through n-1 must satisfy set up (tDIVICKH) and hold (tICKHDIX) 
times to be latched and shifted in the ISR on the rising edge of ICK. Similar requirements must be met when 
lIC = 1 (negative edge-triggered). 
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ILEN = 00 





INTERNAL IBF 







































IcK 
WIUVVUVVVVN WVVVVVYVYVYVVYVYVYVYVYVYVYVYVYYYYY 
Ie XXXXXXXXXXX MXXXXAXAAAAAMARAA AAA 
VVVV VVVV V 
Di XXXX be-2 XXXX bn-t A X 
INTERNAL IBF 
Figure 9-3 Serial Input Protocols (IC = 0) 
tiCKLICKL 
tiICKLICKH 
tICKHICKL 
ICK Vim(1.5V) — 
tILDHICKH tICKHILDH 
tICKHILDL 
tILDLICKH 
VIH — 
ILD 
vil — 


tDIVICKH 


tiICKHDIX 


Figure 9-4 Serial Input Timing (IIC = 0) 
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9.1.4 Serial Output Port 


The serial output port (SOP) is used to transmit serial data from the DSP3210 to external devices or data streams. 
The serial output port consists of the output buffer (obuf), the output shift register (OSR), the output load shift 
register (OLSR), and associated logic, as shown in Figure 9-5. The SOP supports a synchronous three-wire 
protocol: clock (OCK), load (OLD), and data (DO). An internal flag, output buffer empty (OBE), is available to 
indicate the status of the OBUF. 


Serial output data on DO is clocked out of the output shift register (OSR) on the rising edge of the clock OCK. 
When the obuf is loaded by the program, the interrupt routine, or the DMA facility, the output buffer empty (OBE) 
flag is set low. When the data in obuf is transferred to the OSR, the OBE flag is set high. Together, the obuf and 
OSR form a double buffer so that data can be shifted out of the OSR while data is loaded into the obuf. On each 
rising edge of OCK, the load signal (OLD) is clocked and shifted in the output load shift register (OLSR). The load 
signal is delayed by the OLSR at the bit clock rate and is used in the generation of the load/shift signal for the 
OSR and the internal burst OLD (INTLOADB). If selected by ioc[13] and ioc[9], a burst OLD signal can be 
generated by the SIO that is active only when new data has been loaded into the obuf rather than on a periodic 
basis. 











key: INTERNAL DATA BUS 
AOC _ioc(8) MMIO BUS 

AOL __ioc[9} 

OLEN  ioo[10-12} 

BO ioc[13] 

OUT _ ioc[17] 


OUT 
OBE 


obuf (8, 16, 32) 





Do OSR (32) <t— 
Ock Load/Shitt 
Vv OLSR(33) OSE 
(USED TO 
(8) GENERATE BURST 
INTERNAL OLD) 
OLD 


OLEN 
INTLOADC INTLOADB 


Figure 9-5 Serial Output Port 
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9.1.4.1 Configuring the Serial Output Port 


OCK and OLD can be supplied off-chip (external mode) from an external peripheral, or on-chip (internal mode) 
from the on-chip SIO clock generator (see Section 9.1.5 SIO On-chip Clock Generator). The selections of internal 
or external mode for OCK, ioc[8], and OLD, ioc{9], are totally independent of each other. The internal clock 
generator, as configured by ioc[3—0] and ioc[23—20], generates the internal bit clock and load signal. The 
internal bit clock rate can be set for rates between CKI/4 and CKI/60 via ioc[23—20], ICN. The internal load 
signal can be configured in two ways: (1) continuous or (2) burst. The continuous internal load signal is a divide 
by 32 of either ICK or OCK, based on ioc[1]. Note that with a continuous bit clock, the internal load signal will 
occur every 32 bit clocks. The burst OLD signal uses the OBE flag and the OLSR output (OSE) to generate an 
OLD that is active only when new data has been loaded into the OBUF. 


The length of output data can be specified to be 8, 16, 24, or 32 bits via ioc[12—10]. In all cases, the data 
corresponds to the bits after the OLD high-to-low transition. The order of the serial output bits can be specified as 
least significant bit first, LSB, or most significant bit first, MSB, via ioc[17], OUT. OUT is used to select which bit of 
the Internal Data Bus or MMIO Bus loaded into each bit of OBUF. If OUT is set to 0, bit 0 — obuf(0), bit 

1— obuf(1), ..., bit31— obuf(31). If OUT is set to 1, bit 0 > obuf(31), bit 1 > obuf(30), ..., bit31 —» obuf(0). 
MSB can not be used with an output data length of 24 bits. Eight-bit data is always placed in bits 7—0, 16-bit 
data is always placed in bits 15—0, and 32-bit data is placed in bits 31—0. The data bus is used for DMA 
controlled transfers and the MMIO bus is used for programmed transfers. 


The protocol for serial ouput is shown in Figure 9-6. 





INTERNAL OBE 


Figure 9-6 Serial Output Protocol 





9.1.4.2 Serial Output Timing Specifications 
A typical output transmission is shown in Figure 9-7. The timing requirements and characteristics are specified as 
follows: 


= OCK Clock Requirements: The requirements are the period of the output bit clock OCK (tOCKLOCKL), the 
high time of OCK (tOCKHOCKL), and the low time of OCK (tOCKLOCKH). 


= OLD Requirements to Initiate an Output Transmission: OLD is high before the rising edge of OCK 
(tOLDHOCKH), and is held high until after the rising edge of OCK (tOCKHOLDL). OLD is set up low before 
the next rising edge of OCK (t(OLDLOCKH) and must be held low until after that edge of OCK (tOCKHOLDH). 


= DOCharacteristics: Serial output bits 0 through n-1 appear on DO (tOCKHDOV) and are held (tOCKHDOX) 
after the rising edge of OCK. 
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Figure 9-7 Serial Output Timing 
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Figure 9-8 On-Chip Clock Generator 
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9.1.5 SIO On-Chip Clock Generator 


Figure 9-8 is a block diagram of the on-chip clock generator which consists of three configurable counters and 
associated logic. This unit generates the internal bit clock, internal load, and internal synchronization signals. 
The clock generator also has the ability to detect the frame boundary condition and two error conditions, loss of 
sync and loss of sanity. These conditions are useful in TDM applications where frames of data are being 
processed. 


9.1.5.1 Configuring the Clock Generator 


The DSP3210 system clock, CKI, can be selected to be divided by 1 to 15, based on ICN. It is further divided by 
4 to generate the internal bit clock, INTCLK. If ioc[23—20] = 0, the internal bit clock divider is disabled and 
INTCLK is set to zero. The internal bit clock is used when the internal ICK is selected for the input port or the 
internal OCK is selected for the output port. The continuous internal load clock, INTLOADC, is derived by dividing 
by 32 either the input (ICK) or the output (OCK) bit clock, as selected by ioc[1]. This signal provides a load signal 
every 32 periods of the bit clock and is used when the internal ILD is selected for the input port or the continuous 
internal OLD is selected for the output port. INTLOADC is synchronized to SY by each high-to-low transition of 
SY. The internal synchronization signal, INTSY, is derived by dividing the internal load signal by 8, 16, or 32, 
based on ioc[3-2]. 


For example, with ioc[3—0] = 5, ioc[23—20] = 6 and CKI = 49.152 MHz, the internal clock generator is configured 
for a 2.048 MHz bit clock, a 64 kHz load signal, and an 8 kHz synchronization signal. The signals generated by 
the clock generator for this case are shown in Figure 9-9a. 


An internal OLD that is generated in a burst manner can also be selected. The burst internal OLD is generated 
(one-to-zero transition) whenever the output shift register is empty and the output buffer becomes full, or when the 
output buffer is full and the output shift register becomes empty. The second case provides back-to-back serial 
output transmissions. The burst internal OLD signal, INTLOADB, is normally high and makes a single clock cycle 
low pulse to indicate the start of an output. Figure 9-9b illustrates how OLD is generated with respect to the 
output buffer status and shift register status for the second case described above. 


9.1.5.2 Clock Generator Timing Specifications 


Typical uses of the clock generator are shown in Figures 9-10a—c, for ICK configured as positive edge-triggered. 
The timing requirements and characteristics are specified as follows: 


= Bit Clock Characteristics: The period of the data bit clock ICK (tICKHICKH) and OCK (tOCKHOCKH), the 
high time of ICK (tICKHICKL) and OCK (tOCKHOCKL), and the low time of ICK (tlCKLICKH) and OCK 
(tOCKLICKH) are shown in Figure 9-10a. 


= Load and SY Characteristics: The delay of the ILD or OLD, and SY, when configured as internal mode from 
ICK (tICKHILDL, tICKHSYL) or OCK (tOCKHOLDL, tOCKHSYL) based on ioc[1] are shown in Figure 9-10b. 


= External SY Requirements for Synchronization: When an external SY is used to synchronize the internal 
clock generator, a high-to-low transition on SY is required. The SY is set up high before the rising edge of 
ICK or OCK depending on ioc[1] (tSYHICKH, tsYHOCKH), and is held high until after the rising edge 
(tICKHSYX, tOCKHSYX). SY is set up low before the next rising edge (tSYLICKH, tSYLOCKH) and held low 
after that edge (tICKHSYX, tOCKHSYX). 


= Load Characteristic with External SY: The internal load is synchronized on every high-to-low transition of 
SY. After SY goes low, ILD and OLD will be low (tSYLILDL,tSYLOLDL). 
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9.1.5.3 Loss of Sanity 


Note: Error Condition Handling for Loss of Sanity is not supported in the DSP3210. 


Loss of sanity is defined to be a high-to-low transition on SY while ioc[15], SAN, is a one. To make use of this 
feature, SAN should be set to one at the beginning of the program processing the frame of data, and cleared to 
zero before waiting for SY to be zero. If the program processing the frame takes longer that one frame interval, 
then the loss of sanity error is reported to the error exception handler. 


9.1.5.4 Loss of Sync 


Note: Error Condition Handling for Loss of Sync is not supported in the DSP3210. 


Loss of sync is defined to be the occurence of an INTSY high-to-low transition (from the on-chip SIO clock 
generator) without a corresponding external SY high-to-low transition (from the SY pin), when SY is applied 
externally (ASY = 0). Note that the INTSY is synchronized (divide down counters are set to zero) by a high-to- 
low transition on the SY pin. For the loss of sync error to be detected, ioc[0] must be 0, and ioc[1—3] must be set 
so that the INTSY is the same frequency as the SY pin. If a loss of sync is detected, an error is reported to the 
error exception handler. 


Kock Ti OOOO 
ILD,OLD 
sy 





a. Continuous Clock Generator Signals 


ock 

OLD 

es ELS (El | ES Ce Bs lS 
OBE 

(INTERNAL) 


b. Burst Mode Internal OLD 


Figure 9-9 Clock Generator Signals 
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Figure 9-10 Clock Generation Timing (IIC = 0) 


9.1.5.5 Frame Boundary Condition 


The on-chip clock generator detects the frame boundary (fo) condition which is useful in certain TDM applications. 
The fo condition is defined to be the first time the output buffer is full following a high-to-low transition on SY, 
provided that ioc[15], SAN, is equal to zero at that time. It uniquely indicates the first channel in a TDM stream. 
After the SY transition is detected while SAN is zero, the fb flag will be the inverse of OBE until it is cleared to zero 
by setting SAN to a one. The frame boundary condition can be tested using conditional branch instructions, e.g. 
if(fos)goto label (see 4.3.1 Control Arithmetic Instructions - Control Group). The frame boundary condition is 
usually used in conjunction with the loss of sanity error condition because toggling of the SAN bit is required for 
both. Figure 9-11 illustrates how the frame boundary condition is used in a TDM application. 
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Figure 9-11 Frame Boundary Condition 


9.1.6 Handling SIO Activity 


This section describes three typical ways of handling SIO activity: program controlled, interrupt controlled, and 
DMA controlled. An example is given for each, 


9.1.6.1 Program Controlled SIO 


When using the program controlled mode of SIO activity, flags pertaining to the status of the input buffer, output 
buffer, or data stream are tested by the program. When the appropriate condition becomes true, an I/O event is 
initiated by the program. There are four flags available to the programmer, ibf, obe, sy, and fo. Both true and 
false states of these flags are directly testable by an instruction of the form: 


if (IO COND) goto {rH, N, rH+N} 
For more information, see 4.3.1 Control Arithmetic Instructions - Control Group. 


For example, in order to process input data as it is received by the serial input port, the program must wait until 
the input buffer is full. This is accomplished by the following instructions: 


waitibf: if(ibe) pegoto waitibf /* conditional branch using pc relative addressing */ 
nop /* latent instruction */ 


215 


DSP3210 Information Manual 


9.1.6.2 Interrupt Controlled SIO 


When using the interrupt controlled mode of SIO activity, an interrupt routine is used to perform the I/O event. 
The DSP3210 supports individuallly maskable interrupts based on the input buffer status flag, ibf, and the output 
buffer status flag, obe, in the emr register (see Table 7-2 Exception Mask Register Encoding). emr[12] is the 
interrupt mask bit for OBE and emr{11] is the interrupt mask bit for IBF. A value of one in the interrupt mask bit 
enables the corresponding interrupt; a value of zero disables the interrupt. The priority of concurrent silo 
interrupts is such that the interrupt based on input buffer full would take priority over the interrupt based on output 
buffer empty. 


Listing 9-1 shows an example of using interupts to receive 16-bit data by the serial input port and place it ina 
FIFO maintained in memory, while 32-bit output data from memory is retrieved and loaded into the output buffer. 





Listing 9-1 SIO Interupt Example 


#define emr ior8 
#define ibuf *0x0404 
#define obuf *0x0408 


122 = etable /* set evtp */ 

120 = ibuffer 

19 = ibuffer /* register used to store start of buffer */ 
r21 = obox 


5 = (short) emr 
15 = (short) r5 | 0x1800 
emrs (short) r5 


/* enable interrupts on IBF and OBE */ 


main: /* program body */ 
pegoto main 
ibuffer: N‘int /" FIFO storage in memory */ 
obox: float 
etable: 22*nop /* First eleven entries of evt */ 
pegoto ibfisr /* branch to IBF interrupt service routine */ 
nop 
pegoto obeisr /* branch to IBF interrupt service routine */ 
nop 
8*nop /* Remainder of evt */ 
ibfisr: t= (short) ibuf 
nop 
*r20++ = (short) r1 /* write ibuf to memory */ 
120 - 121 /* compare 120 to end of buffer */ 
if (mi) r20 = 119 /* reload starting address if at end */ 
ireturn 
obeisr: m=*r21 
nop 
obuf=r1 7" load obuf from rt */ 
ireturn 
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9.1.6.3 DMA Controlled SIO 


The DSP3210 includes a DMA option for transferring data between the ibuf, obuf, and memory without program 
intervention. The serial DMA option is particularly useful for multichannel applications in which input and output 
buffers for an entire frame of samples are established in memory. 


The DSP3210 contains a DMA controller block to perform the necessary address generation and control for the 
DMA process (see Section 9.2 DMA Controller). The DMA controller consists of two channels, one for input DMA 
and one for output DMA. The registers to configure and enable DMA activity are MMIO registers and are 
described fully in Section 9.2. All serial /O DMA activity is based on either the IBF flag or the OBE flag. Based 
on one or both of these flags, an input DMA will occur, an output DMA will occur, or both an input and an output 
DMA will occur. The last type will perform both operations, input DMA and then output DMA, in one DMA cycle, 
where a DMA cycle is equal to one instruction cycle. 


To buffer a frame of serial input data in DSP3210 memory, the following registers need to be setup: 


icnt transfer count for input DMA 
idp input dma pointer 
dmac bits 4-7 define the mode of input DMA 


For a buffer of 128, 16-bit samples, the icnt register should be configured for 0x00fe0000 (one less than the 
number of transfers desired times the data size in bytes). The idp should be set for the base address of the buffer 
which in this case must be aligned for 16-bit data (bit 0 must be 0). After the icnt and idp registers have been 
initialized, the dmac is loaded with the desired configuration. Program Listing 9-2 shows the code necessary to 
initialize the input channel of the DMA controller and enable input DMA. An interrupt can be taken when the input 
DMA transfer is complete by unmasking the IFRM bit of the emr register and supplying an appropriate interrupt 
routine. 


Listing 9-2 Input DMA Example ; 
#define icnt *0x428 


#define idp *0x420 
#define dmac *0x430 


r= Oxfe 7 icnt for 128, 16-bit samples */ 
icnt=r1 
M=ibuffer /* starting address of input buffer */ 
idp=r1 
11=0x30 /* enable input DMA, and stop when transfer count is reached */ 
dmac=r1 
ibuffer: 128*int /* input DMA buffer */ 
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9.2 DMA Controller (DMAC) 


The DMA controller is a two channel Direct Memory Access (DMA) control unit. It is used in conjunction with the 
Serial I/O to buffer input and output data streams in DSP3210 memory, internal or external. The DMA transfers 
occur between memory and the SIO ports without processor intervention, i.e., using cycle-stealing where the 
DSP3210 waits during a DMA transfer (see Section 7.3 Direct Memory Access). 


9.2.1 DMAC Programming Model 


The register programming model for the DMA controller consists of five MMIO registers. The idp and the odp are 
the input DMA and the output DMA pointer registers, respectively. Each is 32-bits and therefore can point to any 
location in the DSP3210 address space. When read, the idp and odp return the value of the current pointer 
register. The icnt and ocnt registers define the transfer count for input and output DMA frames, respectively. A ~ 
DMA frame is considered to be the number of DMA transfers to be performed. The transfer count is the number 
of bytes to be transferred. The transfer count is specified in bytes and is set to one transfer less than the 
frame size; e.g., for an output frame size of 64 and a transfer size of 16-bits the ocnt register is set for 126 or 
(64 — 1) * 2. When read, the icnt and ocnt return the value of the current transfer count register. The dmac 
register is used to configure the operating modes of both DMA controllers. The dmac register is set to zero on 
reset. The encoding and functional definition of each bit field of the dmac register are described in Table 9-3. 
The address is given for both big and little endian byte orderings. The byte order is specified by pcw[8]. The 
register programming model for the DMAC is pictured below: 


Register Address Address Register Bits 
Name Big Endian __Litttle Endian 
31 16 15 87 0 


idp oxo420 020 [ee eI NY 
odp oxo429 00) LSS ee aR 


ient Ox042a 0x0428 SS ae 1 
ocnt 0x042e 0x042C er 
dmac 0x0433 0x0430 CJ] Rw f 


9.2.2 DMAC Functional Description 


Each DMA controller consists of a base address register, a transfer count register, an adder, a counter, and a 
comparator (see Figure 9-12). A DMA channel can be configured to operate in one of two primary modes: stop 
on transfer complete mode or reload mode. The stop on transfer complete mode is used to transfer a single 
frame of data, and the reload mode is normally used when transferring continuous frames of data. In the reload 
mode, the base address register and the transfer count register operate in a double buffered manner. The 
register currently being used is called the current register, and the register to be used next is called the next 
register. 


In the stop on transfer complete mode (ORS = 0 or IRS = 0), when the processor loads the idp, odp, icnt, or ocnt 
registers, both the next register and the current register are written, and the transfer counter is set to zero. The 
address of the current location being pointed to is generated by adding the counter value to the current DMA 
pointer register. The comparator is used to compare the counter output to the current transfer count. When the 
counter equals the current transfer count the enable/disable bit (ODE or IDE) is set to zero, and the counter is set 
to zero. 
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Figure 9-12 DMAC Block Diagram 


In the reload mode (ORS = 1 or IRS = 1), when the processor loads the idp, odp, icnt, or ocnt registers, only the 
next register is affected. When the counter equals the current transfer count, the current pointer register and 
current transfer count registers are loaded from the next pointer register and next transfer count registers, 
respectively, and the transfer counter is set to zero. The enable/disable bit is unaffected. When the first frame is 
being set up, the dmac register should be configured such that DMA is disabled and the channel is in the stop on 
transfer complete mode. 


A mode is provided so that DMA cycles are requested only when the DSP3210 has ownership of its bus interface. 
This is useful when the DSP3210 is buffering data to or from a shared external memory, because it guarantees 
that the DMA cycle will not incur the overhead of waiting to gain ownership of the bus. This mode is enabled with 
the ORB for output DMA and IRB for input DMA. It operates by requesting the bus interface when a DMA 
operation has been requested, but not performing the DMA cycle until the bus has been granted. 


Note: When a DMA controller is enabled, the mode of operation, as specified by the dmac register, and the size 
of the transfer, as specified by the ioc register, should not be changed. 
9.2.3 Serial Output DMA 


The operations performed for a serial output DMA transfer are: 


= write the contents of the memory location pointed to by odp plus the counter value to the output buffer 
= increment the counter by 1,2,or 4 to obtain the next offset. 


The transfer size and increment amount are determined by the ioc register settings for the OSZ bit and the OLEN 
field (see Table 9-1 ioc Register). If OSZ = 1 and OLEN selects 8-bit input data length, the transfer size is byte 
and the counter is incremented by one. If OSZ=1 and OLEN selects 16-bit input data length, the transfer size is 
short and the counter is incremented by 2. If OSZ = 0 or OSZ = 1 and OLEN selects 32-bits (OLEN = 011 or 
100), the transfer size is long and the counter is incremented by 4. 
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9.2.4 Serial Input DMA 


The operations performed for a serial input DMA transfer are: 


= Writes the contents of the input buffer to the memory location pointed to by idp plus the counter value 
= Increment the counter by 1,2,or 4 to obtain the next offset 


The transfer size and increment amount are determined by the ioc register settings for the ISZ bit and the ILEN 
field (see Table 9-1 ioc Register). If ISZ = 1 and ILEN selects 8-bit input data length, the transfer size is byte and 
the counter is incremented by 1. If ISZ = 1 and ILEN selects 16-bit input data length, the transfer size is short and 
the counter is incremented by 2. If ISZ = 0 or ISZ =1 and ILEN selects 32-bits (ILEN = 00 or 11), the transfer size 
is long and the counter is incremented by 4. 


Table 9-3 DMAC Register Encoding 
Bit Hi: 6 5 4 3 2 1 0 
Field [_iRB_ | IRS | We [| cS [ORB | oRS | ODE | OCs | 


Function 
If 0, don't change output DMA settings, dmac[1,2], when the dmac register is loaded. If 1, 
change output DMA settings when the dmac register is loaded to new values specified in bits 
1,2. 
If 0, disable output DMA. If 1, enable output DMA and request DMA cycle when OBE is 
asserted, obuf is empty. 
If 0, stop output DMA when the number of DMA transfers (OCOUNTER) reaches the number 
specified in OCNT. If 1, reload the current output DMA pointer and the current output transfer 
count registers, ODP and OCNT, respectively, when the number of DMA transfers 
(OCOUNTER) reaches the number specified in OCNT and keep output DMA enabled. 
If 0, request output DMA cycles based on OBE and dmac[1]. If 1, request output DMA cycles 
based on OBE, dmac[1], and if the DSP3210 has ownership of the bus interface. If the 
DSP3210 is not the current bus master, the BRN is asserted. When the bus is granted, the 
DMA cycle is requested. 
If 0, don't change input DMA settings, dmac[4,5], when the dmac register is loaded. If 1, 
change input DMA settings when the dmac register is loaded to new values specified in bits 4,5. 


If 0, disable input DMA. If 1, enable input DMA and request DMA cycle when IBF is aserted, 
ibuf is full. 


If 0, stop input DMA when the number of DMA transfers (ICOUNTER) reaches the number 
specified in ICNT. If 1, reload the current input DMA pointer and the current input transfer count 
registers, IDP and ICNT, respectively, when the number of DMA transfers (ICOUNTER) 
reaches the number specified in ICNT and keep output DMA enabled. 

If 0, request input DMA cycles based on IBF and dmac{4]. If 1, request DMA cycles based on 
IBF, dmac{4], and if the DSP3210 has ownership of the bus interface. If the DSP3210 is not 
the current bus master, the BRN is asserted. When the bus is granted, the DMA cycle is 
requested. 
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9.3 Timer 


The timer is a programmable 32-bit counter and associated control register and logic that can be used for interval 
timing, rate generation, event counting, or waveform generation. The input to the timer can be derived from the 
DSP3210 clock or come from an external source. The output of the timer can generate a maskable interrupt or be 
selected as an output of the chip to drive external hardware. The count-down timer can be configured to count to 
zero once, or to count continuously by automatically reloading the counter with its initial value when it reaches 
zero. 


9.3.1 Timer Programming Model 


Access to the timer is provided by reading and writing the tcon and timer MMIO registers. When the timer is 
written, the initial count register and counter are loaded. When the timer is read, the current counter value is read. 
The count value may be read or changed at any time during operation. The tcon register is used to configure and 
control the timer. Table 9-4 describes the function of each bit of the tcon register. On reset, all bits of the tcon 
register are set to zero and the timer is set to all 1s. The address for the timer registers is given for big and little 
endian byte ordering. The byte ordering is specified by pcw[8]. The register programming model for the timer is: 


Register Address Address Register Bits 
Name Big Endian _Litttle Endian 
31 


7 0 
tcon 0x0413 0x0410 Co] Rw 
timer oot ote 


9.3.2 Timer Functional Description 


The timer consists of an 8-bit control register (tcon), the initial count register, the counter, and associated control 
logic (source selection, counter control, and output control), see Figure 9-13. The source selection logic is used 
to select the clock source for the counter and is based on the value of the tcon register. The source can be a 
divide down of the DSP system clock, CKI, or the source can be the external signal applied to the pin BIOO, Bit 
V/O 0. Divide-down rates for CKI of 2, and 4 can be selected. The true or complement of the external source, 
BIOO, can be selected. The maximum frequency of an external source is CKI/2. The source selection logic 
detects zero-to-one transitions of the selected clock source. 


The counter control logic controls the loading and decrementing of the counter. The counter is loaded when the 
timer is written, and when the count reaches zero if tcon[1] = 1 (automatic reload of counter). If the counter is 
loaded when it is enabled, the loading of the counter takes preference over decrementing. The counter is 
decremented by one for each zero-to-one transition of the clock source detected by the source selection logic. At 
any time, the counter can be stopped and the count held by setting tcon[0] to a zero. Setting tcon[0] to a one at 
this time will continue the count-down operation from the current count. 


The output control logic controls the type of output signal generated by the counter. Whenever the count reaches 
zero, an interrupt request is generated, the state of a toggle flip-flop changes state, and a pulse, one CKI period 
wide, is generated. In order for the DSP to respond to the interrupt, the timer interrupt bit in the emr register, 
emr[9], must be set to one (unmasked) (see Table 7-2 emr Register Encoding). Depending on the setting of 
tcon[2], either the output of the toggle flip-flop or the pulse will be available to be observed external to the DSP via 
pin BIO1. For this, pin BIO1 has to be configured as an output and tcon[4] has to be set to one. If the pulse mode 
is selected (tcon{2] = 0), the pulse can be active high or active low, depending on tcon[3]. The active high signal 
is normally low, and when the count reaches zero it will pulse high for one clock period. The active low signal is 
normally high, and when the count reaches zero it will pulse low for one clock period. 


Use of the timer usually starts with loading the initial count register and the counter. This is done by loading the 
timer register. After loading the counter, the count-down operation is enabled by setting tcon[0] to a one. When 
the count reaches zero and if tcon[1] is set to a one, the counter is reloaded automatically from the initial count 
register, without the need for program intervention. The period of the count is N+1, where N is the value loaded in 
the counter. Alternatively, when the count reaches zero and if tcon[1] is set to a zero, the counter stops. In this 
case, the counting interval is N, where N is the value loaded into the counter. 


9-21 





DSP3210 Information Manual 


By setting the counter to zero and configuring it to reload continuously, it is possible to generate a clock, 
observable externally at the BIO1 pin, which has either a CKI/2 or a CKI/4 rate. For example, a setting of TCON 
to 0x13 will result in an external clock with a rate of CKI/2 and 50% duty cycle; a setting of TCON to 0x17 
generates an external clock with a rate of CKI/4 and 50% duty cycle; a setting of TCON to 0x33 generates an 
external clock with a rate of CKI/4 and 75% duty cycle (3 CKI periods high, 1 CKI period low); and a setting of 
TCON to 0x3b will give an external clock with a rate of CKI/4 and 25% duty cycle (1 CKI period high, 3 CKI 
periods low). 


MMIO BUS 


= 


= 
a LS Pa ee Spe 














INTERRUPT 
ae CONTROL COUNTER (32) ple 
cKI | 4 | — | (101) 






Figure 9-13 Timer Block Diagram 
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Table 9-4 TCON Register Encoding 


Bit 76 a5 4 3 2 1 0 
Field TPN] RSN_ E/N | 


Hold/enable count-down operation of counter. 

0 - Hold the count. 

1 - Enable the count-down operation of counter. 

Select between continuous and one-time operation. 

0 - Stop counting when the count reaches zero. 

1 - Automatic reload of the counter from the initial count register when the count reaches 
zero. 

Select pulse or toggle operation of the output when the count reaches zero. 

0 - For the external output, a pulse is generated when the count reaches zero. 

1 - For the external output, the output is the output of a toggle flip-flop which is toggled 
when the count reaches zero. 

When tcon[2] is set to zero, pulse, select low active or high active pulse. 

0o- es tcon|2] is set to zero, the pulse is one CKI clock period wide and is low active (1- 
>0->1). 

1 - When tcon(2] is set to zero, the pulse is one CKI clock period wide and is active high 
(0->1->0). 

If the BIO1 pin is configured as an output, 

0 - The BIO1 pin has the value specified by the BIO1 field in the BIO register. 

1 - The BIO1 pin is selected to be the timer output. 

Selects the clock source for the timer. 

tcon 7.6.5 

000 Counter clock is CKI/2 

001 Counter clock is CKI/4 

010 Reserved 

011 Reserved 

100 Counter clock is from external source(pin BIO0) - rising edge 

101 Counter clock is from external source (pin BIOO) - falling edge 

11x Reserved 
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9.3.3 Timer Timing Specifications 


When BIO0 or BIO! are selected to be used as the timer clock source or the timer output, respectively, the 
following timing specifications apply. BIOO is an asynchronous input and is synchronized to the DSP3210 by an 
internal version of CKI. If an input makes a transition during the sample window, the level recognized by the 
processor is not predictable; however, the processor always resolves the latched level to either a logic one or zero 
before using it. Figure 9-14 shows the sampling window for BIOO and the timing relationship between BIO1 and 
CKI. BIO1 changes on the falling edge of CKI. The specifications for BIOO when it is asynchronous to the 
DSP3210 are that the period (tBIOOHBIOOH) must be greater than twice the CKI period, and the high or low time 
(tBIOOHBIOOL and tBIOOLBIOOH) must be greater than one CKI period. 














cxI VIM (1.5 V) — 
ICKILBIO1X 
BIO1 Nou 
vol — 
tCKILBIOOX 
tBIOOVCKIL 
ViH — 
BIOO 


(SYNCHRONOUS) VIL — 


SAMPLE 
WINDOW 


tBIOOHBIOOH 


tBIOOLBIOOH 





tBIOOHBIOOL 


BIOO VIM (1.5 V) — 
(ASYNCHRONOUS) 


Figure 9-14 Timer Timing Specifications 


9.4 Bit I/O (BIO) 


The BIO is a general-purpose 8-bit input/output port. It includes features that make it suitable for board-level 
status signal generation and control signal testing by the DSP3210. The BIO interface consists of 8 V/O lines, any 
of which can be independently configured as an input or an output. Outputs can be written with a1 or a0, 
toggled, or left unchanged. Inputs can be directly read and loaded into a CAU register and then tested. 
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9.4.1 BIO Programming Model 


Access to the BIO is provided by reading and writing the bloc and bio MMIO registers using CA Data Move 
Instructions (e.g. *0x0418 = (byte) r1, r1 = (short) *0x041C, ...). The bioc register is used to configure each bit of 
the BIO as an input or an output. Table 9-5 describes the function of each bit of the bioc register. The bio register 
is used to alter the state of bits configured as outputs or to determine the logical state of BIO inputs. Table 9-6 
describes the function of each bit of the bio register. On reset, all bits of the bioc register and output register bits 
of the bio register are set to zero. The register programming model for the BIO is as follows: 


Register Address Address Register Bits 
Name Big Endian Little Endian 
31 16 15 87 0 
bioc 0x041b 0x0418 Co) Rw 


bio 0x041e 0x041C [es a ee 


Table 9-5 bioc Register Encoding 
Bit iz. 6 5 4 3 rh 1 0 
Let i Ee i (a (Re ee (| 


Each bit in the bioc register corresponds to one of the pins in the BIO port, BO—BIOO, B1—BI01, ..., B7—BIO7. 
A value of 0 in the bit indicates that the associated pin is configured as an input; a value of 1 indicates that the 
pin is configured as an output. On reset all bits are set to zero (input). 


Table 9-6 BIO Register Encodings (Writing and ae 


Bit 15 14 13 12 1110 
Field [ge eee ors tare} BO] 


Each pair of bits (field) in the bio register corresponds to one of the bio pins in the BIO port, BFO-BIOO, 
BF1—BIO1, ..., BF7—BIO7. When the bio register is written, the value of each output register bit is changed 
according to the following encoding of the bit field: 


BF bit pair: bit 1.0 
00 Keep the previous value in the bio output register bit 
01 Set the bio output register bit to zero 
10 Set the bio output register bit to one 
W Complement the value in the bio output register bit 


The transfer function is described below, where Qt is the previous value of an output register bit and Qt+1 is the 
new value: 


CSS SS eS 
() t) t) C) 
0 t) 1 0) 
t) 1 () 1 
C) 1 1 1 
1 () C) 1 
1 C) 1 () 
1 1 () 1 
1 1 1 () 


When the bio register is read, the value of both the corresponding bio pin and the bio output register bit are read 
according to the following encoding of the bit field: 


bit 0 Bio pin value 
bit 1 Value of the bio output register bit 
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Figure 9-15 Bit /O Bit Slice Diagrams 


9.4.2 BIO Functional Description 


Figure 9-15 shows bit slices for the two types of bits in the BIO. The structure for Bit 0 and bits 2—7 are the 
same, except that the BIOO pin can be used as the timer source and BIO7—4 can be used to input the reset 
values of pew[10—13]. Bit 1 has the additional capability of serving as the timer output, therefore an additional 
multiplexer controlled by tcon[4] is needed in this bit slice. 


Each bit of the BIO port can be configured independently as either an input or an output by writing the desired 
value in the bioc (bit /O configuration) register (see Table 9-5). If the pin is configured as an output, values 
written to the bio register are immediately reflected to the pin. The value of each output pin can be changed by 
writing the bio register with an appropriate pattern as described in Table 9-6. Each bio output register bit can be 
set to a 1 or a0, toggled, or left unchanged. When the bio register is read, both the logic value of the pin and the 
value of the output register bit (the value that would be driven to the pin if the pin were configured as an output) 
are read. 


9.4.3 BIO Timing Specifications 


When configured as inputs by the bioc register, BIO pins are asynchronous inputs that are synchronized to the 
DSP3210 by an internal version of CKI. If an input makes a transition during the sample window, the level 
recognized by the processor is not predictable; however, the processor always resolves the latched level to either 
a logic one or zero before using it. Figure 9-16 shows the sampling window for BlO inputs. When configured as 
outputs, BIO pins change with respect to the falling edge of CKI. The timing relationship between the BIO outputs 
and CKI is also shown in Figure 9-16. 
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Figure 9-16 BIO Timing Specifications 
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10. INSTRUCTION AND REGISTER ENCODINGS 


10.1 Instruction Set Encodings 
Tables 10-1 to 10-3 illustrate the bit-level encoding of the DSP3210 instruction set. 
Table 10-1 Instruction Encodings 


ALE BRGOe BSG SH ee 876543210 
foa & 1a. Reserved | [ooo 0 of | | 














fb. Do instr(mmes) [4 0 0 0 1 tJolelmM] | TT TT « fob 






































éb. 32-bit ALU Reg Sc] 10 0 1 1 O]0 





6c. 16-bit ALU immed [0 0 0 1 1 0|1 F rD 
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Table 10-1 Instruction Encodings (cont.) 


31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 151413121110 9 8 7 6 5 43 21 0 


castmnummed]1 00114 0/1] F | 
awove-on Joo 6 1 4 4]0[T 
























































5.DAU-Spec.Func]O 41 1 1 1] [n Joooo0000 
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Table 10-2 CA Instruction Field Encodings 






























[Fe | atusFunetion | 
+ addition 

<<n shift left by n 

- subtract {N} - rD 

# carry reverse add 

- subtract rD - {N, rS} 
reserved 

&~ and with complement 
- compare (no store) 

* exclusive or 

>>>1 rotate right by 1 

| bitwise OR 

<<<1 rotate left by 1 

>>n logical shift right by n 
$>>n_ arithmetic shift right 
byn 
& bitwise AND 


unsigned char 
signed char 
unsigned short 


signed short 
unsigned high 
char 

reserved 
reserved 

long 





Reserved 
ns 
ri6 
7 
ns 
ng 
n 
+n 
120 
r21 
122 

Reserved 

Reserved 

Reserved 

pesh 

Reserved 






CA-T Field 


0 Data is moved to a register 
1 Data is moved from a register 





CA - R Field 
[ R_[voRegister] R_ | VORegister_| 
ps spc 
Reserved Reserved 


Reserved pew 
Reserved Reserved 
emr dauc 
Reserved ctr 
Reserved 
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Table 10-2 CA Instruction Field Encodings (cont.) 


C Field - CAU & DAU Condition Codes 
[Code [Flags] Meaning 


none always true 
result is nonnegative (plus) 


peeded result is negative (minus; 
Ges eee 
result equal to zero 
overflow clear, no overflow 
overflow set, overflow 
c: carry clear, no carry 
carry set, cari 


less than zero 
greater than zero 
z\(n4v)=1 less than or equal to zero 
greater than (unsigned) 
clz=1 less than (unsigned) 
underflow clear, no underflow 
underflow set, underflow 































overflow set, overflow 
less than or equal to zero 
Reserved 


C Field - /O Condition Codes 
Condition [__t Meaning» 1 _ saver 


0 input buffer empty 
ibf=1 input buffer full 
Reserved 
1 
1 















g 


o 
ty 

x|+ 4]= alo oJo o]= 3=|= =|Joaloo 

x] alo of— =Jo o]= =Joo]= =|oo 

x] of2 of of= of=0]20]20]a0 


ibf 
obf 





Reserved 


oO output buffer FULL 
obe=1 output buffer emp 
|| Reserved 


g 
Reserved 

Sy= SY (W/O sync) set 
Serial /O frame boundary clear 

fo: Serial /O frame boundary set 
Interrupt Request 0 clear (logic 0) 
Interrupt Request 0 set (logic 1 
Interrupt Request 1 clear (logic 0) 
Interrupt Request 1 set (logic 1 


Gear (LE Sa a) 


irtc 
irls 





oolooloolooloojooloajoad 
x|+4]= =]= =|= =|Joolocolooloo 
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Table 10-3 DA Instruction Field Encodings 






DA -G Field 
za Operation 




















ic 
oc 
floati6 
intt6 
round 
ifalt 
ifaeq 
ifagt 

float32 
int32 
Reserved 
Reserved 
ieee (DSP to IEEE) 
dsp (IEEE to DSP) 

seed 













Reserved 


DA-M Field 
| M | Operation 
000 ad 
001 al 


010 a2 


011 a3 


110 Not Allowed 
111 Not Allowed 





















DA - X, Y, Z Fields 
Le | operation | 


Selects register-direct 


no 
nm 
m2 
n3 
m4 
Not allowed 


DA -N Field 
Tn | operation | 
00 


ad 
01 al 
10 a2 
"1 a3 
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DA-X, Y, Z Fields 
| 1 [Operation (p+0000) 


0 
ns 
né 
n7 
ns 
ng 
-A(f), -2(i), -1(b) 
+4(f), +2(i), +1(0) 


DA- X,Y,Z Field 
| M_ | Operation (p=0000) 


a0 -X, Y fields only 
a1 -X, Y fields only 
a2—-X, Y fields only 
a3 —X, Y fields only 



























Reserved 
Reserved 
Reserved 
No write - Z field only 


DA - S and F Fields 
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10.2 Register Encodings 


Tables 10-4 to 10-8 illustrate the bit-level encoding for the IO registers of the DSP3210. Tables 10-9 to 10-12 
illustrate the bit-level encoding for the MMIO registers of the DSP3210. 
Table 10-4 ps Register Encoding 


Bit] 15 | 14 | 13 | 12 | 11 | 10-| 9 | 8 
Field[_o To [ int [ iro | fo T sy | obe [ ibf | 





CAU n flag; means that the ALU result is negative. 

CAU z flag; means that the ALU result is zero. 

CAU v flag; means that the ALU result overflowed. 

CAU ¢ flag; means that the ALU result had a carry out of the MSB. 
DAU N flag; means that the DAU result is negative. 

DAU Z flag; means that the DAU result is zero. 

DAU U flag; means that the DAU result underflowed. 

DAU V flag; means that the DAU result overflowed. 

Serial input buffer full flag; means that the serial input buffer is full. 
Serial output buffer empty flag; means that the serial output buffer is empty. 
Serial sync set flag; means that the SY pin is high (logic one). 

Serial frame boundary flag; means that the SIO is at a frame boundary. 
Pin IRON is high (logic one). 

Pin IR1N is high (logic one). 

Reserved - read as zero. 





. 


Table 10-5 dauc Register Encoding 
Bit re 6 5 4 3 2 1 o 
Field[___ 0 |, COND ROUND CONV 


[ Bit(s) [Field [Encoding [Function 
j-law input conversion. 

A-law input conversion, 

Unsigned linear byte input conversion. 
p-law output conversion. 

A-law output conversion. 

Unsigned linear byte output conversion. 
Round-to-nearest on float-to-integer conversions. 
Truncate towards -co on float-to-integer conversion. 
Truncate towards 0 on float-to-integer conversion. 
DA instructions /falt, ifaeq, and ifagt have unconditional Z-writes. 
DA instructions /falt, ifaeg, and ifagt have conditional Z-writes. 


Must be set to zero; Read as zero. 
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Table 10-6 ctr Register Encoding 


Bit 7 


6 5 4 3 1 ) 
Field [isos sn | ae Ons | eN-Biw fis eN Ace] SN-G ie oo 





Current DAU N Flag 

DAU N Flag computed for Instruction —1 
(previous DA instruction that set flags) 
DAU N Flag computed for Instruction -2 
DAU N Flag computed for Instruction -3 
DAU N Flag computed for Instruction —4 
DAU N Flag computed for Instruction -5 
Reserved - read as zero 





Table 10-7 emr Register Encoding 




















Bit| 15 14 132 12 1 10 9 8 cd 6 5 4 3 2 1 0 
Field OFAM] IFAM] OSE | 15 | Res | TIMER [EXTO| Res | NAN] UW [AERR 


Each bit in this register corresponds to one of the maskable sources for an exception. A value of 1 in a position 
enables the corresponding interrupt source; a value of 0 disables the source. On reset all bits are set to 0 (zero). 


[ Bits) | Field __| 


eee 












If 1, select processor level to be visible 
on IACK[1—0]. Encoding: 

00 - Base level 

01 - Interrupt level 

10 - Error level 

11 - Double error level 
Reserved 
Addressing Error 
DAU Underflow/Overflow 
IEEE NAN 
Reserved 
Interrupt pin 0 (IRON) request 
TIMER counted to zero 
Reserved 
SIO input buffer full 
SIO output buffer empty 
SIO input DMA frame complete 
SIO output DMA frame complete 
Interrupt pin 1 (IR1N) re 





















Highest 
| 
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Table 10-8 pew Register Encoding (cont.) 


10-8 


0-or-more wait-states 
1-or-more wait-states 
2-or-more wait-states 
3-or-more wait-states 


0-or-more wait-states 
1-or-more wait-states 
2-or-more wait-states 
3-or-more wait-states 
These bits configure the page size for asserting the page break detect (PBD) signal. A2—A9 
are always masked, A17—A31 are always unmasked, A10—A16 masking is controlled by the 
PBD field when comparing the last address to external memory. Set to {0,0,0} on reset. 
pew 6.5.4 Page Size DRAM size 
000 256 words 64k xn 
512 words 256k xn 
1k words 1Mxn 
011 2k words 4Mxn 
4k words 16Mxn 
8k words 64M xn 
16k words 256M xn 
32k words iGxn 
This bit enables(1)/disables(0) sequential address break detection and can be used with a 
page size of 256 only. Set to one on reset. 
Configures the byte ordering. If 0, little endian byte ordering is selected. If 1, big endian byte 
ordering is selected. Set to one on reset. 
Configures the MSO—MS3 pins. If 0, Intel style signaling is selected. If 1, Motorola style 
signaling is selected. Set to one on reset. 
Selects between one of two memory maps. If 1, 1Computer mode is selected and on-chip 
memory is at location zero. If 0, wProcessor mode is selected and external memory is at 
location zero. Reset value based on the state of bio[7] during reset. 
Configures the RW pin. If 0, RW pin is active high for write transactions. If 1, RW pin is active 
high for read transactions. Reset value based on the state of bio[6] during reset. 
Configures the operation of pins AEN/MRN and DEN/MWN. If 0, SRAM interface signaling 
selected (MRN and MWN). If 1, DRAM system bus interface signaling selected (AEN and 
DEN). Reset value based on the state of bio[5] during reset. 
Reset value based on the state of bio[4] during reset. Controls Boot ROM routine. Also, when 
written with a one, modification of the pew is disabled. 
Configures the PBD/BLMN signal. Set to {0,0} on reset. 
BLMN is selected 
PBD is selected; addresses to partition A are monitored. 
PBD is selected; addresses to partition B are monitored. 
PBD is selected; addresses to partitions A and B are monitored. 
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Table 10-9 loc Register Encoding 


If 0, SY is external. If 1, SY is internal. When generated internally, SY = {ICK, OCK}/{256, 512, 
1024}, based on ioc[1] and ioc[3,2]. 

If 0, ICK is used to derive the internal load and SY signals. If 1, OCK is used to derive the internal 
load and SY signals. 

These bits select the frequency ratio of the on-chip load signal to the on-chip SY signal. 


ratio = 8 

ratio = 16 

ratio = 32 
If 0, ICK is external. If 1, ICK is generated internally with a frequency specified by ioc[23—20]. 
If 0, ILD is external. If 1, ILD is generated internally with a frequency of {ICK,OCK}/32, based on 
ioc{1]. 
These bits specify the length of the serial input data. 


input length is 32 bits (prior to ILD) 
input length is 8 bits (after ILD) 
input length is 16 bits (after ILD) 
input length is 32 bits (after ILD} 
if 0, OCK is external. If 1, OCK is internally generated with a frequency specified by ioc[23—20]. 


If 0, OLD is external. If 1, OLD is internally generated: (1) with a frequency of {ICK,OCK}/32, based 
on ioc[1] if ioc[13]=0, or (2) in a burst-mode if ioc[13]=1. 
the | 


output length is 8 bits 

output length is 16 bits 

output length is 32 bits 

output length is 24 bits 

reserved 

reserved 
If 0, the internal OLD is {ICK,OCK}/32, based on ioc]. If 1, the internal OLD is burst-mode, 
generated as a function of the status of the obuf and OSR. 
If 0, the ICK is true (not inverted), If 1, the ICK is inverted. 
If 0, clear sanity bit. If 1, set sanity bit. 
If 0, the LSB is received first during serial inputs. If 1, the MSB is received first during serial inputs. 
If 0, the LSB is transmitted first during serial outputs. If 1, the MSB is transmitted first during serial 
outputs (can't be used with 24-bit output length). 
lf 0, the input DMA transfer size is 32-bits. If 1, the input DMA size is specified by ILEN. 
If 0, the output DMA transfer size is 32-bits. If 1, the output DMA size is specified by OLEN. 
These bits specify the divide-down rate of the internal clock generator. The internal clock generator 
has an additional fixed divide-down by four, i.e., if bits 23—20 represent N, the divide-down rate is 
N*4, If ICN = 0, the active clock generator is disabled. 
Reserved; Read as zero. 
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Table 10-10 dmac Register Encoding 
Bit 7: 6 5 4 3 2 1 it) 
Field[_IRB__| IRS | WDE [| ICS [ORB | ORS [| ODE | OCS | 


If 0, don't change output DMA settings, dmac[1,2], when the dmac register is loaded. If 1, 
change output DMA settings when the dmac register is loaded to new values specified in 
bits 1,2. 


If 0, disable output DMA. If 1, enable output DMA and request DMA cycle when OBE is 
asserted, obuf is empty. 


If 0, Stop output DMA when the number of DMA transfers (QCOUNTER) reaches the 
number specified in OCNT. If 1, reload the current output DMA pointer and the current 
output transfer count registers, ODP and OCNT, respectively, when the number of DMA 
transfers (OCOUNTER) reaches the number specified in OCNT and keep output DMA 
enabled. 

If 0, request output DMA cycles based on OBE and dmac[1]. If 1, request output DMA 
cycles based on OBE, dmac[1], and if the DSP3210 has ownership of the bus interface. If 
the DSP3210 is not the current bus master, the BRN is asserted. When the bus is 
granted, the DMA cycle is requested. 

If 0, don't change input DMA settings, dmac[4,5], when the dmac register is loaded. If 1, 
change input DMA settings when the dmac register is loaded to new values specified in 
bits 4,5. 


If 0, disable input DMA. If 1, enable input DMA and request DMA cycle when IBF is 
aserted, ibuf is full. 


If 0, stop input DMA when the number of DMA transfers (ICOUNTER) reaches the number 
specified in ICNT. If 1, reload the current input DMA pointer and the current input transfer 
count registers, IDP and ICNT, respectively, when the number of DMA transfers 
ICOUNTER) reaches the number specified in ICNT and keep output DMA enabled. 
If 0, request input DMA cycles based on IBF and dmac{4]. If 1, request DMA cycles based 
on IBF, dmac[4], and if the DSP3210 has ownership of the bus interface. If the DSP3210 
is not the current bus master, the BRN is asserted. When the bus is granted, the DMA 
cle is requested. 
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Table 10-11 tcon Register Encoding 


Bit 7 6 5 4 3 2 1 0 
Field T/PN R/SN 


E/HN | Hold/enable count-down operation of counter 
0 - Hold the count 
1 - Enable the count-down operation of counter 


Select between continuous and one-time operation 
0 - Stop counting when the count reaches zero 
1 - Automatic reload of the counter from the initial count register when the 
count reaches zero 
T/PN_ | Select pulse or toggle operation of the output when the count reaches zero 
0 - For the external output, a pulse is generated when the count reaches 
zero 
1 - For the external output, the output is the output of a toggle flip-flop which 
is toggled when the count reaches zero 
H/LN_ | When tcon[2] is set to zero, pulse, select low active or high active pulse 
0 - When tcon[2] is set to zero, the pulse is one CKI clock period wide and is 
low active (1->0->1) 
1 - When tcon[Q] is set to zero, the pulse is one CKI clock period wide and is 
active high 


OUT | If the BIO1 pin is configured as an output, 
0 - the BIO1 pin has the value specified by the BIO1 field in the BIO register 
1 - the BIO1 pin is selected to be the timer output 
SRC _ | Selects the clock source for the timer 


Counter clock is CKI/2 

Counter clock is CKI/4 

Reserved 

Reserved 

Counter clock is from external source(pin BIOO) - rising 


edge 

Counter clock is from external source (pin BIOO) - falling 
edge 

Reserved 
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Table 10-12 bioc Register Encoding 
Bit 7 6 5 4 3 2 1 tt) 
Fleid|_apra_ | _mpet | spo) | sb42 | eas | Ba [ptr [Bo 


Each bit in the bioc register corresponds to one of the pins in the BIO port, BO—BIOO, Bi—BI01, ...,B7—BIO7. 
A value of 0 in the bit indicates that the associated pin is configured as an input; a value of 1 indicates that the pin 
is configured as an output. On reset all bits are set to zero (input). 


Table 10-13 bio Register Encoding 


Bit|_ 1514] 13 12 abl 10 9 8 i 6 5 4 3 2 1 0 
Field 
Each pair of bits (field) in the bio register corresponds to one of the bio pins in the BIO port, BFO—BIOO, 


BF1i—BIO1, ..., BE7—BIO7. When the bio register is written, the value of each output register bit is changed 
according to the following encoding of the bit field: 


BF bit pair: bit 1.0 
00 Keep the previous value in the bio output register bit 


01 Set the bio output register bit to zero 
10 Set the bio output register bit to one 
1 Complement the value in the bio output register bit 


The transfer function is described below, where Qt is the previous value and Qt+1 is the new value: 
i Bito Ot+1 





tt) 0 0 . 0 previous 
te) te) 1 0 false 

tt) 1 i) 1 true 

0 1 1 1 toggle 

1 0 0 1 previous 
1 0 1 0 false 

1 1 0 1 true 

1 1 1 0 toggle 


When the bio register is read, the value of both the corresponding bio pin and the bio output register bit are read 
according to the following encoding of the bit field: 


bitO Bio pin value 
bit 1 Bio output register value 
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11. PHYSICAL SPECIFICATIONS 
11.1 Pin Assignments............ 
Table 11-1 Pin Descriptio! 









11. PHYSICAL SPECIFICATIONS 


DSP3210 132-Pin PQFP (Plastic Quad Flat Package) 


Dimensions are in inches and (millimeters). 


1.097 (27.86)/1.109 (28.02) 
1.075 (27.30)/1,085 (27.56) 
0,947 (24,05)/0.953 (24.21) 


aaa 0.997 (23.80)/0.943 (23.95) 


AAA AAA AAA RAAB AAR ABA AAA A 
Ni a 4 






0.04270.056 













Sj = | 
= P | & 
= B= ghee 
Sj me 838: 
= Be sets 
4 B jefe 
= = sé 
= B= e388 
G &,8 
= EB 8/& 
= =p EHS 
TUVVVUTVVVVV ITTV TITTY TYTN T TTT 
7 DEGREES 0.067 
ALL ROUND 70) ray 
~~ bac | -.012 (0.30)MAX 
l 0.180 
(4.57) 


0.025 (0.64)-e1 Le 
TYP 


0.012 (0.30) mcd) 


DSP3210 Information Manual 
Physical Specifications 


0.076 (1,93). 
0.080 (2.03) TYP 
0.035 (0.889), 
MIN (TYP) 0,058 (4.47), 
0,062 (1.57) 






0,006 (0.152) 
0.015 (0.381) 


a TYP 


0.020,(0.508 
(0.030 (0.762) 


DETALA 
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11.1 Pin Assignments 


84 7,83 


O BINNO. 1 MARK 
116 51 
ie) 


Table 11-1 Pin Descriptions 























| wise 
| OD _i[R 
iS. ee eS ee 
| 0 VO) | Daibos = BOS 
Sa aa BST 6 OE PA 
7 [2 v0(3)__ [Data Bus - Bit2 
ES 


Beresa 

ae BO Sel 

On | Sees D4] PSO) ee | Data Bus Bites 
SE as Be Oe ES ea 
Sa aes 








Sa as) VoD Se So) aT ESS 
VO Data Bus - Bit 6 

D7 VO(3) Data Bus - Bit 7 

a es ee ee Re 
Riis D9 Data Bus - Bit 9 

Vss P Ground 


[APS a MDURP AT esses CPTELERESESTESES ES 
Data Bus - Bit 10 
a a Dit Data Bus - Bit 11 
a SS eee ES a ce | 





















22 013 VO(3. Data Bus - Bit 13 
23 Vop P Suppl 

SS Se Sn a al 
25 5 vO(3; Data Bus - Bit 15 








* | = Input, O = Output, P = Power, OD = Open Drain, NC = No connection, (3) = 3-state 
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Table 11-1 Pin Descriptions (continued) 


isetypo: | _—b “ewName_. seer | 
VO(3 Data Enable/Write Strobe 
[Ce ae a oc 


mae) 
DLE | _i__| DataLatch Enable 
pp 










DEN/MWN 


VO(3 Data Bus - Bit 16 
17 VO(3 Data Bus - Bit 17 
[| VO(3)__| 


is] 


=| < DIO} <|<|a Ol<|gjglo}< Oo} o}<}<) <| 
‘) 
Nc [i TI TTT fill 


[_vo(3)__ 
VO(3 
Dig 
et 2 Se a 
a ee 
Data Bus - Bit 22 
025 
aay (Ce a a 
D26 
BL CC ar a 
D30 
| OS): ReagWirte 
fe EE CN i 
ee a ae 
BGACKN | O(3)_| Bus Grant Acknowledge 
[=O(S)F=ail(Gyca State =A 
Bares Se aes 
BES Tl a) 
[1 [Synchronous Ready 
Sam (i as 
aes ht aaa eas 
LOCKN BOS i 
PBD/BLMN 
[O(3) [Memory Select3_ 
MS2 | 0(3) [Memory Select2 
MS1 | 0(3) [Memory Select? 
MSO [0(3) [Memory Selecto_ 


* | = Input, O = Output, P = Power, OD = Open Drain, NC = No connection, (3) = 3-state. 


fe) 
a) 


1} 
foo} 
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Table 11-1 Pin Descriptions (continued) 


67 
ddress Bus - Bit 31 
\ddress Bus - Bit 30 
ddress Bus - Bit 29 
\ddress Bus - Bit 28 
ddress Bus - Bit 27 








>|> 


>| 


SN) 
N]=|S) 


ol pe) 
EN (7) 






Address Bus - Bit 26 
Address Bus - Bit25 
Address Bus - Bit 24 
Address Bus - Bit 23 
Address Bus - Bit 22 
ddress Bus - Bit 21 
\ddress Bus - Bit 20 
ddress Bus - Bit 19 


| 


| 


ssn] 51 
bard £5) f=) Po) BN) Fe) (a) 


>|> 


ddress Enable/Read Strobe 
\ddress Bus - Bit 18 


i 





> 


\ddress Bus - Bit 17 
ddress Bus - Bit 16 
ddress Bus - Bit 15 
\ddress Bus - Bit 14 


DI>| 


i 
>| 


\ddress Bus - Bit 13 
ddress Bus - Bit 12 
ddress Bus - Bit 11 
ddress Bus - Bit 10 


* |= Input, O = Output, P = Power, OD = Open Drain, NC = No connection, (3) = 3-state. 





| 


> 


> 
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Table 11-1 Pin Descriptions (continued) 


Address Bus - Bit 9 
pp 

Ground 

Address Bus - Bit 8 
Address Bus - Bit 7 
Address Bus - Bit 6 
Address Bus - Bit S 
Address Bus - Bit 4 
Address Bus - Bit3 






lock In 
it VO - Bit 7 
VO(3) it VO - Bité 


a ee 
BIOS 10) it VO - BitS 
BIO4 VO(3' it VO - Bit 4 


Bae 
[| __O(3) __| Address Bus - Bit2 
ae ees 
| vis) 











[4 BIOs 0(3)_[ Bit VO - Bit 3 
it VO - Bit 2 
a a ee 
SSS ae er el 
[ie BOTT 03)_[ Bit vO - Bit1 
[19 BIO 03)_[ Bit VO - Bit 
ae ee Be 
[Ser 2S |e eI | Serial Detain: 
[| _i24 DO 3)_— Serial Data Out 
p 
a een ee Ea) 
p 
Output Load 
Interrupt Acknowledge 0 
ea 1ACKi O38} Interrupt Acknowledge 1 
Sara IRON I Interrupt Request 0 

132 Vpb P Suppl 


* | =Input, © = Output, P = Power, OD = Open Drain, NC = No connection, (3) = 3-state. 
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12. ELECTRICAL AND TIMING SPECIFICATIONS 
12.1 Electrical Specifications... 

Table 12-1 Electrical Requirements and Character 
12.1.1 Maximum Ratings .. 
12.2 Timing Specifications. 

Figure 12-1 CKI Timing Speci 

Table 12-2 Timing Requirements for CKI. 

Table 12-3 Timing Requirements for Synchronous Bus Interface Inputs 

Table 12-4 Timing Characteristics for Synchronous Bus Interface Outputs. 

Table 12-5 Timing Relationships for Synchronous Bus Interface Operatio 

Figure 12-2 Synchronous Bus Interface Timing... 

Figure 12-3 Asynchronous Bus Interface Timing 

Table 12-6 Timing Relationships for Asynchronous Bus Interface Operation 

Figure 12-4 Bus Arbitration Timing. 

Table 12-7 Timing Characteristics for Bu: 

Figure 12-5 Serial Input Timing ......... 

Table 12-8 Timing Requirements for Serial Inpu' 

Figure 12-6 Serial Output Timing 

Table 12-9 Timing Renee for Serial Outputs 

Table 12-10 Timing Characteristics for Serial Outputs 

Table 12-11 Timing Requirements for Serial Clock Generation. 

Table 12-12 Timing Characteristics for Serial Clock Generatior 

Figure 12-7 ac Serial Clock Generator Timing 

Figure 12-8 Bit /O Timing ... 

Table 12-13 Timing Requirements for Bit 

Table 12-14 Timing Characteristics for Bit /O 

Figure 12-9 Interrupt Timing... 

Table 12-15 Timing Requirements for Interrupts. 

Table 12-16 Timing Characteristics for Interrupts 

Figure 12-10 RESTN. 

Table 12-17 Timing Ri for Reset 

Table 12-18 Timing Characteristics for Rese’ 

Figure 12-11 Power-on Reset Timing........ 

Figure 12-12 Externally Controlled Reset and ZN Timing. 
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12. ELECTRICAL AND TIMING SPECIFICATIONS 


12.1 Electrical Specifications 


Table 12-1 Electrical Requirements and Characteristics 
The parameters below are valid for the following conditions: TA = 0 to 70 °C; VoD= 5 V+ 5%; Vss = 0 V 


Description [ Symbol | 


[Min 
Input Voltage except CKI: Low VIL rege | 
High VIH 




















Input Voltage for CKI: Low VILC 
High VIHC 









Output Voltage: (IOL = 6 mA) Low (TTL) VOL 
(IOL = 5 pA) Low (CMOS) VOL 
(TTL) VOH 






High (CMOS) VOH 
Output Current: (VOL = 0.4 V) Low 
VOH = 2.4 V) High 1OH 
Input Leakage except ZN: a 
a 1H 
Input Leakage for ZN: (VIL = 0 V) Low 
‘VIH = 5.5V) High 1H 
Output Hi-Z Current: (Vapplied = 0.0 V) Low 
‘Vapplied = 5.25 V) Hig 10ZH 
Input, Output, IOput capacitance 
Power Supply Current Vop = 5.25 V, T = 18 ns | 10D | 
Power Dissipation Vo = 5.25 V, T = 18 ns feAPD SI 
*  Currentin the input buffers is highly dependent on the input voltage level. At full CMOS levels, essentially no DC current is drawn, but for 
levels near the threshhold of 1.4 V, high and unstable levels of current may flow. There are 56 input buffers on the chip. If all inputs are 
connected to a de voltage of 1.4 V, an additional current in the range of 100 mA can be drawn. 


+ The power dissipation listed is for internal power dissipation only. Total power dissipation can be calculated based on the system 
configuration by adding C * Voo * f for each output, where C is the load capacitance, and f is the output frequency. 






























12.1.1 Maximum Ratings 


Stresses in excess of the Absolute Maximum Ratings can cause permanent damage to the device. These are 
absolute stress ratings only. Functional operation of the device is not implied at these or any other conditions in 
excess of those given in the operational sections of the data sheet. Exposure to Absolute Maximum Ratings for 
extended periods can adversely affect device reliability. 


VOLTAGE RANGE ON ANY PIN WITH RESPECT TO GROUND 
AMBIENT TEMPERATURE RANGE -—40 to +120 °C 
STORAGE TEMPERATURE RANGE .. -65 to +150 °C 


Warning: All CMOS devices are prone to latch-up if excessive current is injected to/from the substrate. To 
prevent latch-up at power-up, no input pin should be subjected to input voltages greater than VIL, or less than 
Vss - 0.5 V before VoDis applied. After power-up, input should not be greater than VDD + 0.5 V or less than 
Vss-0,5 V. 


Handling Precautions: All MOS devices must be handled with certain precautions to avoid damage due to the 
accumulation of static charge. Although input protection circuitry has been incorporated into the devices to 
minimize the effect of this static buildup, proper precautions should be taken to avoid exposure to 
electrostatic discharge during handling and mounting. AT&T employs a human body model for ESD 
susceptibility testing. Since the failure voltage of electronic devices is dependent on the current and voltage and, 
hence, the resistance and capacitance, it is important that standard values be employed to establish a reference 
by which to compare test data. Values of 100 pF and 1500 Q are the most common and are the values used in 
the AT&T human body model test circuit. The breakdown voltage for the DSP3210 is greater than 2000 V, 
according to the human body model. ESD data for the charge device model is available on request. 


0.5 to+6V 
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12.2 Timing Specifications 


The characteristics listed are valid under the following conditions: VoD = 5 V + 5%; Vss = 0 V; TA =0 to 70°C. 


DEVICE 
UNDER 
TEST 


CLOAD INCLUDES JIG CAPACITANCE 





ac Testing Load Circuit 


SETUP TIME HOLD TIME 


DELAY TIME 
HOLD TIME 


OUTPUT 





ac Testing Input, Output Waveform 
Test conditions for inputs (unless otherwise specified): 
= Rise and fall times of 4 ns or less 
= Timing reference level for setup times is VIM = 1.5V 
= Timing reference levels for hold times is VIH, VIL 
Test conditions for outputs (unless otherwise specified): 
= CLOAD = 50 pF unless otherwise stated 
= Timing reference level for delay times is VOM = 1.5 V 
= Timing reference levels for hold times is VOH, VOL 


Tables 12-2 to 12-18 are the timing specifications for the 55 MHz DSP3210. 
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VIHC (2.4 V) — 
CKI VIMC (1.5 V) —— 
VILC (0.4 V) — 


Figure 12-1 CKI Timing Specifications 
Table 12-2 Timing Requirements for CKI (See Figure 12-1) 






Pe Description [win | Wax | Unit 
[_At_[ 1CKiLCKIC | Clockin Perioo? [18 | —* | ns | 
[a2 —[ICKILCKI [Clockintow | 81 | — | ns | 
[AS | ICKICKIL[ Clock inHigh | 81 | — | ns | 
[Ad | ICKIRISE [Clock in Rise Time [| __— | 4 | ns | 
[AS | ICKIFALL [Clock InFal Time | — | 4 | ns | 


* This column in all timing requirements and/or characteristic tables refers to the internal alpha-numeric symbol in the timing diagrams. 
t ICKILCKIL is referred to as T for specifications that are a function of the period of CKI. 
+ This device is static. However, the maximum period guaranteed by test is 50 ns. 


Table 12-3 Timing Requirements for Synchronous Bus Interface Inputs (See Figures 12-2 and 12-4) 


[Num [symbol [Description 
[Bi | tSRNVCKIL__[SRDYNSetuptoCKiLow 
| B2 | tCKILSRNX_[SRDYNHoldafterCkiLow 
[83 | tBENVCKIL_[BERRNSetuptoCKiLow 
[B4 | tCKILBRNX [BERRNHoldafterCKiLow 
; 85 | 
| 86 | 
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Table 12-4 Timing Characteristics for Synchronous Bus Interface Outputs (See Figures 12-2 and 12-4) 


















[Buffer Type| Num | Delay Time Symbol | _Num_| Hold Time Symbolt | Max Load] 

La [69 frcxiov B10] rCKLDx «150 pF | 

P= amast A tcKiav | B12] tCKILAX [150 pF _| 
WS0—MS3_[|_A | B13 [tckiMsvV__| B14 [10KILMSX___| 150 pF _| 
cen] A] 815_[rcicsnv | B16 [1CKILGSNX | 50pF | 
a 

[cats [1oKasnH | |__| s0pF _| 

La et reKicrwy | 822 CKIRWX | 50pF | 

LA 83 ]rcKicpev | B24 | tCKILPBX | S0pF_| 
a a 

[827 [GKuRNH |_| | spr] 

[8 | 829 | iCKIHMWNV ae eee |e! 
[a] tCKILLONVY | 832 _[TOKILLONX | 50pF__| 

LA] 833] CKILeRNY | 834 {CKILBRNX | 50pF__| 

P 8 [aes froatpeane Ts _frraseanene fen 





* Description for all Delay Time Symbols in this table is: CK! (Low,High) to Signal (Valid, High, Low). 
+ Description for all Hold Time Symbols in this table is: Signal Value Held after CKI (Low, High). 






Synchronous Delay Times Bimies aa 
(B9,B11,...B35,B37) 

[ Buttentypeta= = Skene |eAn ee Bee SeCi} 

vom epee 

VOL a 


VOH 







Synchronous Hold Times 
(B10,B12,...B36,B38) 











(EES ee ee ee a ES 
(i Ee ae a SS SS SS Se 
(en ee ee ee eS Sa 





Note: NA = Not Applicable. 
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Table 12-5 Timing Relationships for Synchronous Bus Interface Operation (See Figure 12-2) 


Note: All signals referred to in this table are equally loaded with 50 pF. Signal timing relationships with 
respect to DATAin supercede B7 and B8 timing requirements. 


[Num] Symbol _| Description fy FE Soin eee 























tMWNHDZ Data Bus Hi-Z after Write Strobe High 0.5°*T-4 0.5*T+(N-M)*T+2 


[Max | Unit] 
[B39 [ICKILDE | DataBuslow-ZafterCKilow [| | 
Pea |1CKILOZ | DataBusHizafterCKItow | | is | 
[B41 [AVOINV | Address’ ValidtoDATAinVaid | | Tt NTT | os | 
[B42 [TAVASNL [Address Validto Address Swobetow [osst-at_[ o5*T [ns | 
[B43 [tASNLASNH [Address Strobe Width | ot T-24N*T [= ns | 
[B44 [tASNHAX | Address Hold after Address StrobeHigh | | ns | 
[B45 |tASNLDINV | Address Strobe Low toDATAinValid [| 1.5 *T-9+N°T | ns | 
[B46 |tDINASNH | DATAinSetupto Address StobeHigh [8 | ns 
[B47 [tASNHDX | DATAinHold after Address Strobe High [2 | ns | 
MASNHOZ | Data Bus Hi-Z after Address StobeHigh| 30 —+| 2 drs | 
B49 [tAVMANL [Address ValidtoReadStobelow [| —os'T-4 | ns | 
[B50 [IMANLMANH | ReadStrobeWidth Sid] ~ST-4eN'T | 8 | 
B51 [IMANHMANL |ReadStrobeHightoReadSwobelow | osT—2 | — ns | 
[B52 |(MANLOINV | ReadStrobeLowtoDATAinVaid | — | 1.5* T= 11 +N*T I ns | 
[B53 |1DINVMGNH | DATAinSetuptoReadStrobeHigh [10 | ns | 
[54 [IMANHDX | DATAinHold afterReadStobeHigh_ | —# | | 
[B55 [tMANHDE | Read Strobe HightoDataBustowZ | _T-4 | — ——~id; ns | 
[B56 [tRWVDE | Read-WriteAssertedtoDataBuslowz| 1-4 | ns | 
[B57 [tAVMWNE | Address ValidtoWrite’Stobelow | os'T-4 | — | rs | 
ese [IMWNLMWNH| Write Strobe width | T+MeT! Os | 
[B59 [tMWNHAX | Address Hold after Write SrobeHigh | os*T-4 [Ts 
B60 Data Bus Valid to Write Strobe High: ne eed 
0—1 Configured Wait-States T-2 
2—3 Configured Wait-States T-2+(M-1)°T 
B77 | tASNLDV Address Strobe Low to Data Bus Valid: P= | oer [| 
Ows T+2 ns 
1—3 ws 5°T+2 





* Address refers to A2—A31 and MSO—MSS. 
+ T=ICKILCKIL; N = number of wait-states (programmed plus external). 
$ M = Number of wait-states configured in pew. 
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READ (0 WAIT) READ (1 WAIT) WRITE (0 WAIT) WRITE (1 WAIT) IDLE 
CKI viM— 
>: 61) 
A2—A31, “clon St | age See = foes 
MSO—MS3 ire) 6, aa [Ea (=) RRR Gla) Co] me Ss fcr mA fm’ ef nes ee reer 0 0 
2 : Es @® 
cn REEL “SP en LAs L-#_ 
ASN von = [eo =e = 
oes eee Ge |: 
o_o (@ es |< le 
(63}) (633 658) 
See ae 3 seca eae eee 
LOCKN Yor ee 
te ee ee 
o et he ee 
vac Ei i 
rer sid + © 
MWN VOM | @-| 
voL— — — | -2 "Bet Sia & al 
|e iS sle Cle © 
poner el ae eer) eo 
VIL 
+| 6) 
+ = 
balactlig VIN PYKYKKKYKKYY YYYXXXXKYXY/ \XX +7 YYYYXXYX\X YYXYXXKYXXY/ \XX VVVVV V 
BGN VIL LVVVIVIVAVAVIVIV/VAN XX PAXXXXX XXXX XX\ XXXXKXX\ X\ PYXXXXXXKKXK XX\ LXXXXX L\ 


Figure 12-2 Synchronous Bus Interface Timing 
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CKI viM— 
A2—A31, vom 
MSO—MS3 vor— 
CSN vom— 
RW vOM— 

vill 
Do—D31 VOM — vim—= 

wit? 
AEN vin— 
DEN vin— 
DLE viM— 
SRDYN 





Figure 12-3 Asynchronous Bus Interface Timing 
° 


Table 12-6 Timing Relationships for Asynchronous Bus Interface Operation (See Figure 12-3) 


[ieee Doscription <= eal 
Address Enable Low to Address Low-Z 


tAENLAV: 50 pF Address Enable Low to Address Valid 
100 pF 
150 pF 


Address Enable High to Address High-Z 
Data Enable Low to Data Bus Low-Z 


Data Enable High to Data Bus High-Z 
Data Latch Enable Width 
B71 | tDLEHCKIL Data Latch Enable High to CKI Low when 
Asserted during State Ending Read Transaction 
(Bus Interface State Machine is in State E and 





* Address refers to A2—A31 and MSO—MS3. 
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READ OR WRITE READ OR WRITE 
cKI VviM—— 
A2—A31 
mso—Ms3) 0 YO“ 
CSN,ASN,RW,P 
BD,MRN, 
MWN,LockN YO“ 
VOH, 
Do—p31 voy voll D 
mie rraraaaa ae 
SRDYN — LXXXXXXXX\X\ 
BRN 
vi, 
vv 
BGN vio 
BGACKN o> 





Figure 12-4 Bus Arbitration Timing 
Table 12-7 Timing Characteristics for Bus Arbitration (See Figure 12-4) 


[Num | Symbol| Description S| «Min: [ Max [Unit | 

B73 | {CKILBISE | OKI Low to Bus Interface Signals Lowz (| _4 | — | ns_| 
[B74 | tCKILDZ | CKI High to Bus Interface SignalsHighz___| 412 [ns _| 
BGACKN Low to Bus Interface Signals Low-Z | sl oneal 
{BGANHBISZ | BGACKN High to Bus Interface Signals High-z_| -4 | 2 | ns_| 


* Bus Interface Signals are defined to be A2—A31, MSO—MS3,CSN,ASN,RW,PBD,MRN,MWN, and LOCKN. 
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ICK VIM 
ILD VIH — 
vil — 





Figure 12-5 Serial Input Timing 


Table 12-8 Timing Requirements for Serial Inputs (See Figure 12-5) 


[Symbol _[ __Description _| 


tICKHILDX_| Input Load Hold Time 
tDIVICKH __| Serial Input Data Setup 
tiCKHDIX___| Serial Input Data Hold 


* This device is static. However, the maximum period guaranteed by test is 200 ns. 





C1 

C2 

C3 

C4 

C5 Input Load Low Setup Time 
C6 | 

C7 

C8 
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OCK 


OLD 


DO 





Figure 12-6 Serial Output Timing 


Table 12-9 Timing Requirements for Serial Outputs (See Figure 12-6) 


[Num [Symbol “["___—iDescription __ [Max [Unit | 
[C9 | tOCKLOCKL | Serial Output Clock Period ___| p= | ns] 
Suns 
i a 
ju i i ares 

i SSSA 

eee [ane 


















cg 
tTOLDHOCKH _| Output Load High Setup Time 
TOLDLOCKH _| Output Load Low Setup Time 
tOCKHOLDX__| Output Load Hold Time 


* This device is static. However, the maximum period guaranteed by test is 200 ns. 





Table 12-10 Timing Characteristics for Serial Outputs (See Figure 12-6) 


TOCKHDOV 
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Table 12-11 Timing Requirements for Serial Clock Generation (See Figure 12-7) 

[Num] Symbol [Description [Min [Max Unit | 

a i 
(SYHOCKH 
tOCKHSYX 















tSYLOCKH 
tOCKHSYX 


* ICK or OCK as selected by |OC{1]. 





Table 12-12 Timing Characteristics for Serial Clock Generation (See Figure 12-7) 
[Num] Symbol [Description [Max 
C21 tICKHSYL | Internal SY Delay ley yao 









tOCKHSYL 


fie i 
ri helenae ae 
tOCKHOLDL 
tSYLOLDL 
tOCKHOCKH 
C25 | tiCKLICKH | Internal SIO Clock Low Time ICN *2*°T-10 
tOCKLOCKH 


C26 tICKHICKL | Internal SIO ClocksHigh Time ICN*2*T-10 | ICN*2*T+10 
tiCKHICKL 


* ICN refers to the value of the four bit field IOC[23—20]. When ICN-0, the period of the clock is infinity. (The clock is disabled from 
counting.) 
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ICK,OCK 


ICK,OCK 


sy 


ILD,OLD 


ICK,OCK 


SY 


ILD,OLD 


VOM 





a. internal ICK/OCK 


VIM - 


VOL 


VOL 


b. internal ILD/OLD/SY. external ICK/OCK 





VOL 


c. external SY. internal ILD/OLD 
external or internal ICK/OCK 


Figure 12-7 ac Serial Clock Generator Timing 
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CKI VIM = 














BIOO—7 vom = 
(outputs) Y— 
VIH — 

BIOO—7_—s vim 

(INPUTS) i 
SAMPLE 
WINDOW 

BIOO Viki 

(TIMER INPUT- 

ASYNCHRONOUS) 


Figure 12-8 Bit I/O Timing 


Table 12-13 Timing Requirements for Bit I/O (See Figure 12-8) 


[ Num |__Symbol | Description 





tBIOOHBIOOH | Timer Clock Period 


Table 12-14 Timing Characteristics for Bit I/O (See Figures 12-8) 


[Num [Symbol | Description 











[Min | 
tBIOVCKIL | BIO Setup Time fies Wi 4ion 
tCKILBIOX _| BIO Hold Time jrasris 2 ar] 





tBIOOLBIOOH [ Timer Clock Low Time 





[Max _| 
Eas 
Es 
a 
[ D4” | tBIOOHBIOOL [Timer ClockHigh Time | ~T+5 [ = | 
[ D5 _| | 
[Max __| 
|e ia 
a 


[Min] 
TCKILBIOV _| BIO Delay aa 
TCKILBIOX | BIO Hold —-— 
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CKI VIM — 


VOH— 


IACK[O—1]_ Vom 







VIH — 
IR[O—1]N Wie 
(SYNCHRONOUS) 
SAMPLE 
WINDOW 
IR[O—1]N vM oo — 
(ASYNCHRONOUS) 


Figure 12-9 Interrupt Timing 


Table 12-15 Timing Requirements for Interrupts (See Figure 12-9) 


[Num] Symbol [Description Min 
[ £2 | tCKILIRX | Interrupt Asserted Hold Time (Synchronous) __i{| 2 __—t 
| £3 | tIRLIRH | Interrupt Asserted Pulse Width (Asynchronous) __|_ T+5_| 
[L_E4 | 1 Se 
[Unit | 
[ns _| 





[Max | Unit | 
Sana] 
aay oe 
SS El 
jase re 1]. Sins] 


Table 12-16 Timing Characteristics for Interrupts (See Figure 12-9) 


[Num [Symbol __] Description [Min | Max__| 
tCKILIACKL | Interrupt Acknowledge Delay _[  — | 16 ‘| 
ie Des Bel 


tCKILIACKX_| Interrupt Acknowledge Hold ons | 
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Figure 12-10 RESTN Test Load Circuit 


Table 12-17 Timing Requirements for Reset (See Figures 12-10 to12-1 2) 


[Num] Symbol __| 
tBIOVRESTH 
tRESTHBIOX 


TRESTHZNH _| ZN High After RESTN High 
TRESTLRESTH | RESTN In Pulse Width 


* Only BIO[4:7] are latched into the device when RESTN makes a zero-o-one transition. 
+ ZNmay be assorted at any time. This maximum specification permits correct DSP3210 operation in the run 





i 











ae 

tRESTLBIOZ | Pi Sai 
tRESTLBISZ | Power-on RESTN Low to Bus Interface Signals High-Z ; — | 
| 


[ns | 

[ns | 

[ns | 

[_ns | 

RESTN High to Bus Interface Signals Low-Z [ns | 
ower-on RESTN Low to Output and Biput Signals High-zt | — | 9 | ns | 
E == [ns | 

[ns | 

[ns | 

[ns | 

[ns ] 














[ 1ZNLSOBZ | ZN Low to Outputand Biput Signals High-z_ | OT 18 

[“IZNHSOBE_| ZN High to Output and Biput Signalstow-z_ [| 18 

RESTN In Low to BIO High-Z (Inputs} T— | v7 | 
[Fie | tRESTILBISZ | RESTN In Low to Bus Interface Signals HighZ dT OAT + 12 





* Bus Interface Signals are defined to be A2A—A31, MSO—MS3, CSN, ASN, RW, PBD, MRN, MWN, and LOCKN. 
+ Outputs and Biputs are defined to be BRN, BGACKN, BIO[0—7], ICK, ILD, DO, OCK, OLD, SY, and |ACK[O—1]. 
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CKI PULL 
Vop 3Vv 









RESTN VOM — 
ZN ViH 
VIH __ 
BIO[0—7] vow, vim "= 
BUS 
INTERFACE eS 
SIGNALS 
OUTPUTS & VIH 
BIPUTS tim 
RESTN viM — 
ZN viM — 


BIO[4—7] vais 





Vit 
BUS VIH 
INTERFACE Vil — 
SIGNALS 
ViH —Y 
RR ate ests RK MRRON KXKKKKKKRKKKKRKRKKK 


Figure 12-12 Externally Controlled Reset and ZN Timing 
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Appendix A 


BOOT ROM CODE 


Program execution always begins with the instruction at location 0x0. This physical address is either in Boot 
ROM or external memory depending on the the state of pew[10], C/PN, when the DSP3210 is reset. Note that 
pew[10—13] are initialized during the reset state with the logic states applied to bio[7—4], respectively. If C/PN is 
initialized to a one, execution begins from the first location in the Boot ROM. If C/PN is initialized to a zero, 
execution begins from location zero of external memory A. 


When Computer mode is selected, the Boot ROM code is run on reset. The Boot ROM program uses pew(13], 
BRC, to select between two routines stored in the ROM. Table A-1 lists a summary of the operations performed 
by each routine, The routine selected when pew(13] = 0 is used to redirect the starting address of the DSP3210. 
The routine selected when pew[13] = 1 is used to load a program from a external 8-bit wide memory, such as an 
EPROM, and execute it. The EPROM must be 8-bits wide and be mapped to external memory address 
0x60000000. Figure A-1 shows the organization of data in the byte-wide EPROM when using little endian byte 
ordering. 

A listing of the assembly code is provided in Listing A-1. Note that the fill code at the end of the RON is for future 
expansion and should not be used. 


Table A-1 Operation of Boot ROM Routines 


a [cn OS ee 


(1) Enables interrupt 1. 

(2) Enters wait-for-interrupt mode, and when interrupt 1 is acknowledged’. 

(3) Changes memory map to Processor mode. 

(4) Disables interrupt 1. 

(5) Reads location 0 (external memory) for branch vector and writes back a zero to location 0. 

(6) Returns from interrupt by loading pcsh with branch vector so that the DSP3210 starts executing 
from the branch vector. 

(1) Read four bytes, organized as the least significant byte of the first four word addresses, from 
external memory starting at 0x60000000 into a 32-bit integer (define this as the count). 

(2) Read the next four bytes, organized as the least significant byte of the first four word 
addresses, from external memory starting at 0x60000010 into a 32-bit pointer (define this as the 
address). 

(2) If count is zero, branch to address. 

(3) If count is non-zero, read count number of bytes from external memory starting at 0x60000020 

and write them to address to form words and then branch to address. 

* For the code to operate correctly, the chip must enter the wait-for-interrupt mode before taking the interrupt. Note that the emr register 

enables interrupt 1 prior to the waili instruction. If an interrupt is asserted between the enabling of interrupt 1 and the wait instruction, the 
interrupt will be taken with the wrong memory map. The time from RESTN making a low-to-high transition to the program reaching the waiti 
instruction is 64 * T, where T is a CKI period. 
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Memory Bits 
Word Address |31 23 15 7 0} 


0x60000000 COUNT= 
0x60000004 
0x60000008 ADDRESS= 
0x6000000C 
0x60000010 PROGRAM= 


0x60000014 
0x60000018 
0x6000001C 
0x60000020 
0x60000024 
0x60000028 
0x6000002C 
0x60000030 





Figure A-1 EPROM Data Organization (Little Endian Byte Ordering) 
Listing A-1 Boot ROM Code 
r 


DSP3210 Boot ROM 
This code is mask programmed into the BOOT ROM of the DSP3210. 


SYNOPSIS: 


The DSP3210 is reset via the RESTN pin and comes up in Computer mode by setting PCW bit 10 (C/PN) to 1 via 
the BIO7 pin. In addition, PCW bit 13 is set to either a 0 or 1 via the BIO4 pin. PCW bit 13 is used to determine 
which boot procedure to execute. 


PCW Bit 13 = 0 Procedure: 


The DSP3210 sets up r22 to point to 0x5003000 which is the address of the BOOT ROM in Processor mode. 
Next, the EMR register is set to enable External Interrupt 1. The DSP3210 then sets bit 10 of the PCW register to 
0 without changing the other bits in PCW. Finally, the DSP3210 goes into Wait for Interrupt Mode with its memory 
map set for Processor mode. 


When the system host decides to start the DSP3210, it places a vector to the DSP3210 external memory boot 
procedure into physical location zero. The system host then pulses IREQ1 pin (External Interrupt 1 Request) on 
the DSP3210 and begins polling physical location zero. 


The DSP3210 acknowledges the interrupt via the IACK1 pin and vectors to location 0x50030078 which is in the 
BOOT ROM. The DSP3210 then clears the EMR register to prevent any further interrupts. Next, it loads the 
PCSH, pc shadow, register from location zero (which is now in external memory). 


A-2 





Appendix A 


The DSP3210 then writes a zero into location zero. Finally, the DSP3210 performs a return from interrupt which 
executes a NOP from the Shadow Instruction register followed by the instruction which was fetched from the 
vector supplied by the host in physical location zero. 

PCW Bit 13 = 1 Procedure: 


The DSP3210 fetches 8 words starting from external memory 0x60000000. The least significant byte (LSB) of the 
first four words make up the address where the DSP3210 will branch to upon completing the boot procedure. 


The LSB of the next four words contain the number of bytes which are to be copied from the location following 
these eight words into the memory range specified by the starting address. 


Upon completion of the copy operation, the DSP3210 transfers program control to the starting address. Note that 
the memory map remains in the computer mode. 
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r 
z VO Registers Defines 

el 

#define ps jord0 

#define emr __ior08 

#define spc ior10 

#define pew = ior12 

#define dauc_ iorl4 

r 

s PSEUDO Instructions 

7 . 
#define waiti spc =r0 

#define bkpt — spe = (short) rO 
#define reset spc = (byte) r0 


sect ".brom", TEXT 


r 
2 3210 Exception Vector Table 
sf 

Pegoto boot_begin /* Reset */ 


11 = (short) pew 

pegoto boot_error /* Bus Error */ 

nop 

Pcgoto boot_error /* \llegal Opcode */ 

nop 

Pcgoto boot_error /* Reserved */ 

nop 

pegoto boot_error /* Address Error */ 

nop 

pcgoto boot_error /* DAU Overflow/Underflow */ 
nop 

pegoto boot_error 7 NAN */ 

nop 

Pegoto boot_error /* Reserved */ 

nop 

pegoto boot_error /* External Interrupt 0 */ 
nop 

pegoto boot_error /* Timer */ 

nop 

Pegoto boot_error /* Reserved */ 
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nop 
pegoto boot_error /* SIO Input Buffer */ 
nop 
pegoto boot_error /* SIO Output Buffer */ 
nop 
pegoto boot_error /* SIO DMA Input Frame */ 
nop 
pegoto boot_error /* SIO DMA Output Frame */ 
nop 
pesh = (long) *r0 /* External Interrupt 1 */ 
emr = (short) r0 
ireturn 
nop 
boot_begin: 
12 = 10 <<| 0x6000 
11 & 0x2000 
if(eq) pcgoto system_boot 
13 = (ushort24) 0xE000 
do7{ 
14 = *r2++ 
nop 
"134+ = (byte) r4 
} 


15 =*0xE004 /" fetch number of bytes */ 
ri =*0xE000 /* fetch start address */ 
if(eq) pegoto boot_end /* check number of bytes | 
perl 
m=15-1 
do r5{ 
14 = *12++ 
nop 
*r3++ = (byte) r4 
} 


boot_end: 
goto r1+0 
nop 
system_boot: 
122 = 10 <<| 0x5003 
12 = (ushort24) 0x8000 
emr = (short) r2 
11 =r1 & (~0x0400) 
waiti 
pew =r 
*r0 = 10 
boot_error: 
bkpt —/* signal error */ 
Pegoto . 
nop 
/* Fill remaining locations of the Boot ROM with their address */ 


fitbits (. - boot_rom + 0x50030000) 
fitbits (. - boot_rom + 0x50030000) 
fitbits (. - boot_rom + 0x50030000) 
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2's Complement Integer - A method used in some 
systems to represent positive and negative integers. 
Positive integers are identical to standard binary 
numbers; however, negative integers are the 
complement of a standard binary number plus one. 


3-state - To place an output in the high-impedance 
state. 


A-law - A European standard for the compression and 
expansion of the dynamic range of a signal. 


Addressing Modes - Various common modes for 
addressing in the DSP3210 include immediate, 
memory direct, register direct, and register indirect. 


ALU - Arithmetic logic unit 


Arithmetic Logic Unit (ALU) - On-chip unit that 
performs arithmetic operations. 


Assembler - A program that translates symbolically 
represented characters into a form (binary) the 
computer can interpret. 


Bit I/O (BIO) - Input or output pins used for status and 
control. 


Boot ROM - Preprogrammed Read-Only Memory that 
contains instructions that initialize operation of the 
DSP3210. . 


Bus Interface - A general-purpose parallel data port 
that can be used for interfacing to microprocessor 
buses, I/O buses, memory, or memory-mapped 
devices. 


Byte - An 8-bit quantity that may appear at any 
address in memory. 


CA - Control arithmetic. 

CAU - Control arithmetic unit. 

CISC - Complex Instruction Set Computer. 
Companding - The process of compressing and 
expanding a signal. 


Control Arithmetic (CA) Instructions - Instructions 
executed in the CAU that perform address generation, 
integer data arithmetic and logic functions, instruction 
flow control, and data movement. 


Control Arithmetic Unit (CAU) - Execution unit that 
performs 16- or 32-bit, integer arithmetic for logic and 
control functions. 


DA - Data arithmetic. 


Data Arithmetic (DA) Instructions - 32-bit floating- 
point instructions that perform multiply/accumulate 
operations for signal-processing algorithms. DA 
instructions also contain special functions for 


Glossary 


performing data type and format conversions and 
rounding. 


Data Arithmetic Unit (DAU) - Primary execution unit 
that performs 32-bit floating-point arithmetic for signal- 
processing functions. 


Data Arithmetic Unit Control (DAUC) Register - A 
register that controls the type of conversions 
performed in the DAU on input and output data and 
controls rounding modes. 


DAU - Data arithmetic unit. 
DAUC - Data arithmetic unit control. 


Direct Memory Access (DMA) - High-speed data 
transfer operation where an I/O channel transfers 
information directly to or from memory. 


DMA - Direct memory access. 
DMAC - Direct Memory Access Controller 
EMR - Exception mask register. 


Exception Vector Table - Sixteen pairs of 32-bit 
words that control the program flow based on the 
source of an exception. 


Exception Vector Table Pointer (EVTP) - A register 
that locates the starting address of the exception 
vector table. 


Fast Fourier Transform (FFT) - A complex procedure 
for transforming a time-domain waveform into its 
frequency-domain components. 


FFT - Fast Fourier transform. 


Floating-Point Arithmetic - A method of arithmetic in 
the computer that keeps track of the binary point. 


Frame Boundary - The first time the output buffer is 
full following a high-to-low transition on the SY pin, 
provided IOC[15] is equal to zero at that time. Frame 
boundary is generated by the on-chip clock generator 
and is useful for certain TDM interfaces. 


Halfword - A 16-bit quantity that may appear at any 
address in memory. 


VO - Input/output. 


/O Control (1OC) Register - A register used to 
configure the serial I/O interface to external devices. It 
is used to set various I/O configurations, bit lengths, 
internal or external clock, and internal or external 
synchronization. 


IBUF - Input buffer. 


Immediate Mode - An addressing mode in which the 
operand is supplied by the instruction. 
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Indexed Mode - An addressing mode in which the 
address part of the instruction is modified by an 
auxiliary (index) register during the execution of that 
instruction. This mode is achieved with the instruction 
rD =rS +N followed by *rD. 


Input Buffer (IBUF) - A register used to accept serial 
input from an external device. 


Input Shift Register (ISR) - An input register for 
converting serial input data to parallel data. 


Interrupt - A means by which external devices may 
request service by the microprocessor. 


IOC - Input/output control. 
ISR - Input shift register. 


Link Editor - Software support tool used for 
maintaining libraries, defining sections of on-chip and 
off-chip memory, and recognizing holes in memory. 


Memory Direct Mode - An addressing mode in which 
the instruction contains the address of an operand in 
memory. 


Memory Indirect Mode - An addressing mode in 
which a memory location contains the address of the 
data, rather than the data itself. This mode is achieved 
with the instruction rD = MEMORY followed by *rD. + 


Memory Mapped I/O (MMIO) - Instruction referencing 
method for on-chip peripherals. 


Multimedia - The processing of multiple forms of data, 
including sound, speech, data, images, and video. 


OBUF - Output buffer. 
OSR - Output shift register. 


Output Buffer (OBUF) - A register used to pass data 
to the output shift register. 


Output Shift Register (OSR) - A register for 
converting parallel data from the output buffer into a 
serial data stream. 


PCW - Processor control word. 


Pipelining - Overlapping the execution of instructions 
to increase performance. 


Processor Status (PS) Register - Register that 
displays flags for the CAU, DAU, and V/O. 


Register Direct Mode - An addressing mode in which 
the instruction specifies a register that contains the 
operand. 


Register Indirect Mode - An addressing mode in 
which the instruction contains the name of a pointer 
register (rP) that contains the address of the operand. 


Register Mapped I/O (IO) - Instruction referencing 
method for DSP3210 core control registers. 


Relative Mode - An addressing mode in which the 
absolute address is obtained by means of address 
modification. Address modification is performed by the 
addition of a given number to the address part of an 
instruction known as the relative address. This mode 
is attained by moving the program counter (pc) to 
another CAU register and then performing a register 
indirect operation. In relative addressing, the address 
contained in the second byte of the instruction is 
added to the program counter's lowest eight bits plus 
one. 


RISC - Reduced Instruction Set Computer. 


Serial 1/O (SIO) - A group of registers that allows 
interfacing with other devices using few, if any, 
external chips. It converts serial input data and 
parallel output data into serial data. 


Simulator - A highly specific program that allows 
accurate simulations of the logical operation of the 
DSP3210 device. 


SIO - Serial input/output. 


Stack - An area of reserved memory used for storing 
the program counter and the contents of registers 
during a program interrupt. 


Time-Division Multiplexed (TDM) - A procedure for 
transmitting two or more signals over a common 
channel through the use of successive time intervals 
for different devices. 


Timer - Programmable counter. 
TSC - Timer/Status and Control 


Word - A 32-bit quantity that may appear at any 
address in memory. 


ut-law - A United States standard for compressing and 
expanding the dynamic range of a signal. 


uP - Microprocessor. 
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wait for interrupt 3-6, 4-6, 4-85, 7-3 
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CA Arithmetic/Logic Instructions 
















TD = [(short)] rS3+N 
D = 


IFT-OR 










Ee 
a0 | 
[iD =[(shonyDIN | nz00| on 
OR 
ARITH 
[a0 _| 


AND- 
COMPLEMENT 


1D = [hor] 1D #N 


rD = [(short)] rD >> N 


1D = [(short)] rD $>>N 


[if(COND)] rD = [(short)] rS1 + rS2 
{if(COND)] rD = [(short)] rS1 & rS2 
[iKCOND)] 1D = [(short)] rS1 &~ rS2 


[if(COND)] rD = [(short)] rS1 | rS2 


[i{COND)] 1D = [(short)] St *rS2 
[i{COND)] 1D = [(short)] St #rS2 


[i{COND)] 1D = [(short)] S1 << 1S2 
[if(COND)] 1D = [(short)] rS1 >> rS2 


[i{COND)] 1D = [(short)] St $>> 1S 


[i{COND)] [(shorty] St - 152 
[i{COND)] [(short)] S1 & S2 


[i{COND)] 1D = [(short)] rS/2 
[IKCOND)| 1D = [(short)] rS>>1 


{if(COND)] rD = [(short)] rS<<1 
[if(COND)] rD = [(short)] rS*2 


[if(COND)] rD = [(short)] rS>>>1 


[if(COND)] rD = [(short)] rS<<<1 


EXCLUSIVE 
OR 


ADD-CARRY 
REV 
SHIFT LEFT 


SHIFT RIGHT- 
ARITH 


ARITH 


SHIFT LEFT 


ROTATE 
RIGHT 


\OTATE LEFT 


i} 


R 
[if(COND)] 1D = [(short)] -rS 
[if(COND)] 1D = [(short)] rS | __ aod 
[if(COND)] D1 = [(short)] rS {+, -}1_| nzve 
[if(COND)} sp = (long) sp {++, -} 
* The size of arithmetic/logic and data move operations can be 


specified to be long (32-bits), or short (16-bits). Long is the 
default and does not have to be specified. 


=> alet 
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AT&T DSP3210 
Programmer’s Reference 
Card 


Note: { } and [] are not part of the instruction syntax. 
Parentheses () are part of the syntax and must appear 
where shown in an instruction. Lower-case letters are 
part of the syntax and upper-case letters are replaced 
by immediate data, register names, or expressions. 


Fla 

NZVU 
NZVU 

NZVU 


Replacement for DA Multiply/Accumulate Instructions 


| aN, aM__| 
ay i—14), (i—t4}e4, “II, 
*{ri—114}++{r15—119}, a0-a3 


















[Z=]aN = [aM (+lV'X 
aN = [aM {+,}(Z=Y) x 
[z=] aN =[lY (+ JaM"x 
[2-]aN = [YX 

aN = [-] (Z=Y)"X 
[Z=] aN = [JY (+1 
[Z=]aN =[1Y 

aN = [-] (Z=Y) {+} X 



















14}, {i—14}e+, “I—14)-, 
“(1 —114}++{r15—119) 









DA Special Function Instructions 


[[zJaN=ooy) | _— | oo __| 
(Ze]aN=floatta(yy | _Nzoo | FLOATI_| 
[Ze] aN = inte) a 
[[Zalan= ina) | _—_| 





ROUND 





[izalaN=itan(y) —_[_— | ALT _| 
[(Z=laN=ifaeqy) | — | _1FAEQ__| 
(ZalaN = ifagt¥) aa er 
[(ZlaN=cep(%) | Nav] DSP __| 
[iZ=JaN=ieeovy | — _] eee _| 

[see _] 


Restriction 1 - Z-pointer may only be used as Z-pointer in 
the following instruction. 

Restriction 2 - CAU or IO register store cannot follow a DA 
instruction with a Y field memory reference. 

Restriction 3 - CAU register loads cannot be referenced in 
the following instruction. 



















Replacement for DA Special Function Instructions 


Replace.) [55-00 5s Value = | 


“{r1—114}, *{ri—r14}+4, “{r1—114}--, 
“{ri—114}++{r15—119}, a0-a3 
(Y may not be a0—a3 for dsp function. 


ee | “(i—114}, “{—r14}+4, “{rI—r14}--, 
*(r1—114} ++{r1 5—119) 








xf 


if (COND) goto {N, rB, rB+N)° 
if (rM-->=0) goto {N, rB, rB+N}" GOTO-LOOP 
goto {N, rB, rB+N, M, rB+M}" GOTO 
nop NOP 

call {N, rB, rB+N, M} (rM)" CALL 
return (rM) RETURN 
{do,dolock} K, {L, rM} 

doblock {L, rM} 


GOTO-COND 


IRETURN 
SFTRST 





“When using pc as rB, goto and call instructions may be written 
with the syntax pegoto label or pecall label, respectively. The 
linker computes the appropriate N value to encode. ® 


aeq 


age 
alt 


ale 





Replacement for CA Arithmetic/Logic 
Instructions 














16-bit signed integer 
One of the DSP3210 Condition 
Codes 
CA Data Move Group Instructions 
ecco F-= 
rD= (short) N SET 
SET24 
=f STORE 
‘OAD 











rD= (ushort24) M 
{MEM, *L, iorD} = 
{(byte),(hbyte), (short), (long)} rS 
1D = {(char),(byte),(hbyte)} {MEM,"L} | nz00 
1D = {(short),(ushort)} {MEM,"L} nz0o 
1D = (long) {MEM, *L} nzoi 
TD = (char), (byte), (hbyte)} iorS nz00 | LOAD 
1D = {(short),(ushort)} iorS nz00}| LOAD 
1D = (long) iorS. nz00 | LOAD 


MEM = {(byte),(short),(long)} iorS | ee 
LOAI 
1OR 
** The size of data move operations can be specified to 
be long (32-bits), short (16-bits), or byte (8-bits). 


Long is the default and does not have to be 
specified. 











iorD = {(byte),(short),(long)} MEM 


Replacement for CA Data Move Instructions 


“{rO—r22}, *{rO—r22}++, *{rO—122}--, 


*{rO—122}++{r0—122} 


ior{0, 8, 10, 12, 14, 15} 


16-bit number 


eT aNEe 
16-bit number (unsigned) 
[Eee 


24-bit number (unsigned) 





Latency 1 - DA instruction writes may not be 
referenced until four instructions later. 

Latency 2 - An accumulator may not be used as a 
multiplier input until three instructions later. 

Latency 3 - goto, call, and return instructions are 
delayed by one instruction. 

Latency 4 - DAU conditions are established no sooner 
than four instructions prior to the test. 
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